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Abstract 


The results of prior research concerning the association between LIFO adoption decisions and their 
potential financial determinants are inconclusive. From the perspective of the theory of technology 
diffusion, the focus of these studies is limited to economic factors/forces from system outsiders, The effects. 
of interactions among system members are ignored. Evidence from this study, which is based on a 
technology-diffusion approach, suggests that the LIFO adoption process is best explained partially by 
interactions: with investors, creditors ‘and public and partially by interactions with other firms. Puture 
research may benefit from incorporating proxy variables for sociological factors such as imitation. 


The association between LIFO. adoption deci- 
sions and their potential determinants has been 
examined extensively in the accounting litera- 
ture under a variety of hypotheses that focus on 
such factors as tax benefits, the agency effects of 
bonuses, the agency effects of debt covenants 
and the impact of political costs (Watts & Zim- 

merman, 1986). The test results of these hypoth- 
eses are mixed. One major reason may account 
for the inconclusiveness: only publicly available 
financial information is used to examine these 
hypotheses. Yet empirical evidence suggests 
that economic reasons alone may be insufficient 
to explain the choice among alternative 
accounting methods (Tritschler, 1970; Shank & 
Copeland, 1973) and that “factors other than 
those examined apparently have influenced the 
LIFO adoption decision” (Hunt, 1985, p. 466). 
The suggested non-economic factors include 
behavioral and sociological variables (Shank & 
Copeland, 1973; Hussein, 198] ). 





One approach to these variables comes from 
the theory of technology diffusion, which 
models the routes by which techhologies are 
spread and adopted through a population. Sev- 
eral researchers have established the usefulness 
of this technology (innovation) diffusion theory 
to the study of accounting choice decisions 
(Tritschter, 1970; Comiskey & Groves, 1972; 
Hicks, 1978; Hussein, 1981). From a technology 
diffusion perspective, a firm may adopt LIFO for 
economic and/or sociological and behavioral 
reasons. The underlying drive may come from 
investors, creditors and public who are not 
directly in charge of the operation ofthe firm, 
yet have interest in, and may benefit from, the 
firm’s economic fitness, i.e. adopting LIFO by the 
firm, the economic demand or welfare of the in- 
vestors, creditors and public is considered. The 


drive may also come from interactions with `` ~ 


other firms and here the focus is social fitness; 
not economic fitness, e.g. by adopting LIFO the 
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~ firm’s inventory policy is consistent with the 


“norm” of the industry to which it belongs.' 
However, none of the above-mentioned studies 
has specifically tested the process models well 
established in the technology diffusion litera- 


‘ture. The purpose of this study is to examine the. 


LIFO adoption decision by using non-financial 


- data for testing three competing technology dif- 


fusion process models: the non-imitation model, 
the imitation model and the generalized model. 
The results should provide additional insights 
into the LIFO adoption process itself. ; 
The first section of this study presents a 
simplified framework for analyzing accounting 
diffusion processes. The next section is a sum- 
mary of prior research. This is followed by a 
description of three basic diffusion process mod- 
els. Later sections address the selection of the 
sample, the results of the tests and a discussion of 
the limitations and implications of the research. 


A SIMPLIFIED FRAMEWORK 


Technology diffusion can be broadly defined 
as the process by which information about a 
technology is communicated through certain 
channels over time among the members of a 
social system (Mahajan & Peterson, 1985). 
Accordingly, there are four major elements in 
the process: the technology, the channels of 
communication, time and the social system. The 
social system is defined as “a collectivity of 
units” that share a common objective (Rogers & 
Shoemaker, 1971).* The members are adopters 
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and potential adopters of the technology. L. line — 
with the technology diffusion literature, they are 
labeled as system insiders. Those who are not 
directly in charge of the operations, such as the 
investors, creditors and public are system out- 
siders.? A technology is an idea, an object or a 
practice that is, in general, perceived as “new” by 
members of the social system. The technology 
need not be a recent invention (Skank & Cope- 
land, 1973; Hussein, 1981); it may have been 
invented and known to members of the social 
system for some time, but the need for its adop- 
tion has not been felt by the non-adopters (Hus- 
sein, 1981). The channels of communication are 
the routes by which information is transmitted 
to or within the social system. They can be classi- 
fied into two major categories: the vertical chan- 
nels that communicate information to the social 
system and the horizontal channels that com- 
municate within the social system. The element 
of time relates to the relative speed at which the 
technology is adopted. 

A simplified framework for analyzing account- 
ing technology diffusion is developed in accor- 
dance with the above descriptions. It is pre- 
sented in Fig. 1. The technology is LIFO. 
Although a well-known inventory method, LIFO 
is “new” to the non-LIFO firms in the sense that 
the managers of ise firms have not felt the 
need for adopting it.‘ The social system consists 
of accounting entities, i.e. firms; some are LIFO 
adopters, some are potential adopters. All of the 
firms have the objective of producing, selling 
products and/or rendering services to their cus- 
tomers and have to choose among inventory val- 


The theory behind economic fitness and social fitness will be discussed in the Technology Diffusion Process Models section. 


The system members can be individuals, business organizations, public organizations or nations depending upon the specific 
diffusion context. The members share a common objective (Rogers & Shoemaker, 1971) or a common cultare (Mahajan & 
Peterson, 1985). In the LIFO adoption context, the social system consists of firms that have the objective of producing, selling 
products and/or rendering services to their customers and have to select an inventory method for external reporting 
purposes. All the COMPUSTAT firms fit this description. 


` 3The investors are the owners of the firm while the managers are “hired” by the investors to operate the firm. The investors 


have the ownership while the managers have the control of the firm. Since the owners are not directly in charge of the 
decisions and operations of the firm, they are labeled as outsiders. 


‘Rogers (1962) states that “it really matters little, 2s far as human behavior is concerned, whether or not an idea is 
“objectively” new as measured by the amount of time elapsed since its first use or discovery. It is the newness of the idea to 
the individual that determines his reaction to it” (p. 13). 
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Outsiders (Investors, 
creditors, public) 


Social system 





Fig. 1. A simplified framework of technology diffusion process. 


uation methods for external reporting purposes. 
The interactions between the non-LIFO firms 
and investors, creditors and public and between 
the non-LIFO firms and LIFO firms may induce 
the potential adopters to adopt LIFO. The boun- 
daries of the adopters and the potential adopters 
are not fixed, i.e. they are represented by dotted 
lines, so that the former may expand while the 
latter may shrink by passage of time. This frame- 
work is used to evaluate some of the prior re- 
search. 


PRIOR RESEARCH 


Prior research is classified into two groups: 
those studies focused on economics and those 
focused on sociological concepts. The econ- 
omics studies focus only on the association 
between LIFO adoption decisions and their 
financial- determinants. The- -sociologically 
informed inquirers attempt to identify. the gen- 
eral non-economic factors that facilitate the dif- 
fusion of accounting technology. 


Economic studies 

Among what are here being classified as econ- 
omic studies, four hypotheses have been pro- 
posed and tested to explain the choice between 
LIFO and other inventory methods. They are the 
tax-benefit hypothesis, the agency theory— 
bonus hypothesis, the agency theory—debt co- 
venant hypothesis and the political-cost hypo- 
thesis. The tax-benefit hypothesis suggests that a 
manager chooses LIFO since his or her firm’s net 
operating cash inflow is increased due to the re- 
duction in tax liability. The manager of a firm 
may decide to stay on FIFO because a switch to 
LIFO may lead to a reduction of shareholders’ 
wealth due to the lower reported income. Con- 
ceptually, tests of this hypothesis may be based 
on the association between LIFO adoption and 
cash flow position. Previous studies focus only 
on the association between LIFO adoption and 
its effect on the firm’s shareholders’ wealth. The 
shareholders’ wealth is measured by the firm’s 
security prices.It is hypothesized that by. adopt- 
ing LIFO, a firm’s security price will be de- 
creased because of the lower reported income. 
The results of tests are inconsistent. Abdel- 
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Khalik & McKeown (1978) and Ricks (1982) re- 
port a negative association between a LIFO 
switch and security prices. Brown (1980) re- 
ports no significant association, while Sunder 
(1973, 1975) and Biddle & Lindahl (1982) re- 
port a positive association. 

The agency theory—bonus hypothesis focuses 
on the manager—shareholder relationship and 
assumes that shareholders of a firm will reward 
the manager if the firm’s accounting earnings 
number is increased, Le. shareholders’ incentive 
plans influence the behavior of managers. This 
hypothesis proposes that a manager selects an 
income-increasing accounting method, e.g. 
FIFO, because his or her income-based bonus 
increases due to the choice. The results of these 
tests are also mixed. Hagerman & Zmijewski 
(1979) found that compensation was not a sig- 
nificant factor for explaining the choice of inven- 
tory method; even the sign of the regression 
coefficient was opposite to that suggested by the 
hypothesis. Abdel-Khalik (1985) found that 
changing to LIFO did not have a significant nega- 
tive effect on compensation. Zmijewski & 
Hagerman (1981) report that the existence of a 
profit-sharing plan influences the manager to 
choose an income-increasing accounting 
strategy, e.g. the manager would choose FIFO. 
The accounting strategy of a firm is defined in 
their work as a combination of four choices. 
Each relates to the choice between-accelerated 


and straight-line depreciation methods, LIFO, 


and FIFO, amortization of past cost in less than 
30 years and greater than or equal to 30 years 
and deferral and flow-through methods for 
investment tax credit, respectively. 

The agency theory—debt covenant hypothesis 
focuses on the bondholder—shareholder rela- 
tionship. It suggests that there is a positive 
relationship between a firm's contractual con- 
Straints and income-increasing accounting 
methods, Payment of dividends and incurrence 
of additional debt are constrained by the firm’s 
lending agreements (Leftwich, 1983). A LIFO 
firm is more constrained than a FIFO firm 
because by choosing LIFO a firm has a lower 
reported income and a lower retained earnings. 
This will lead the firm to pay lower dividends 
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because of the lower reported income. This will 
also make it more difficult for the firm to incur 
additional debt because of the higher debt- 
equity ratio. In general, payment of dividends is 
measured by the ratio of dividends to unre- 
stricted retained earnings, while incurrence of 
additional debt is measured by the ratio of net 
tangible assets to long-term debt. The results of 
these tests are inconclusive. Morse & 
Richardson (1983) found that the two financial 
ratios were not statistically significant in 
explaining choice of inventory method, while 
Hunt (1985) found them to be significant. 

The political-cost hypothesis suggests that a 
firm may avoid the attention caused by high 
reported profit and therefore choose LIFO, due 
to the public’s association of high profit with 
monopoly rents (Watts & Zimmerman, 1978). 
Political cost is usually measured by the size, i.e. 
sales of the firm. The larger the sales ofa firm, the 
higher the probability this firm will choose LIFO 
to report a lower income to avoid public atten- 
tion. The results of these tests are also inconsis- 
tent. Hagerman & Zmijewski (1979) found that 
size is not statistically significant in explaining 
LIFO adoption while Morse & Richardson 
(1983) found just the opposite. i 

In line with. the diffusion framework pre- 
sented in Fig. 1, it may be concluded that these 
studies consider only part of the whole diffusion 
process. Their main focus, as represented in the ~ 
framework, is on the effects of communication 
between firms and system outsiders. The man- 
agements of the potential adopters presumably 
base their decisions on the demands or welfare 
of the investors (tax-benefit hypothesis, agency 
theory—bonus hypothesis), the creditors 
(agency theory—debt covenant hypothesis) or 
the public (political-cost hypothesis). In this 
type of modeling, no consideration is explicitly 
given to the effects of communication among 
firms in the social system, i.e. no sociological fac- 
tors such as imitation are considered. 


Sociological studies 

Several studies explicitly consider behavioral 
and sociological factors among the social system 
members and study the adoption distribution. 


' 
a 
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Tritschler (1970) used mainly aggregated non- 
economic data from published sources to 
examine the adoption pattern of LIFO and accel- 
erated depreciation methods in the US. and 
price-level adjusted financial statements in 
France. His major result was that tax relief alone 
was not sufficient to explain the choice among 
alternative accounting methods. He conjec- 
tured, without tests, that the adoption process 
should be examined from a sociological per- 
spective. 

Copeland & Shank (1971) used a question- 
naire methodology to examine LIFO adoption 
decisions. Their results supported Tritschler’s 
(1970) view that profit maximization alone does 
not explain LIFO adoption decisions, but the 
weight assigned to behavioral factors was not 
significant. Shank & Copeland (1973) also used 
non-economic data from published sources to 
examine the behavior of “change-receptive” 
firms. They claimed that once a firm has learned 
the effects of one set of alternative accounting 
methods, it tends to adjust rapidly to the effects 
of other sets of alternative accounting methods, 
Le. the firm has formed a learning set. Inventory, 
depreciation and investment tax credit methods 
were jointly considered. The results generally 
supported the “learning set” theory, which 
claims that a firm receptive to change shows 
more accounting changes in a given time period 
than a random sample of firms. 

Rogers (1962) hypothesized that the adop- 
tion frequency distribution over time is a bell- 
shaped normal distribution. The cumulative 
adoption time path, i.e. a plotting of cumulative 
number of adopters against time, is an S-shaped 
curve. Comiskey & Groves (1972) used a ques- 
tionnaire methodology to study the adoption 
distributions of their sample firms. Their focus 


’, was on the frequency distributions of innova- 
` tions, rather than the process models. The 


results showed that the distributions were not 


` normal. Any unimodal frequency distribution, 


however, will generate an S-shaped cumulative 
adoption time path. Comiskey & Groves’ results 
may imply that the frequency distribution, 
although not normal, probably is still unimodal. 

Hicks (1978) also used questionnaires to 
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study the perceptions toward new accounting 
technologies by accounting interest groups. 
One major finding was that system effects are 
important factors in the diffusion process. Sys- 
tem effects are “the influence of the norms, val- 
ues, and functions of a group on the behavior of 
the individual members of that group” (Hicks, 
1978, p. 373). There is a significant relationship 
between an individual’s perception of a technol- 
ogy and his or her membership in an interest 
group. He finds that accounting academicians 
have a higher need for changes than CPAs and 
financial executives who belong to other in- 
terest groups. This concept can be used to high- 
light the importance of the communication be- 
tween firms presented in Fig. 1. The other major 
finding was that new technologies often fail to 
gain adoption by the accounting profession 
because practitioners perceive the need for 
change differently from academicians. This may 
partially explain why many firms still have not 
adopted LIFO even though it is an existing and 
well-known technology. 

Hussein (1981) studied the diffusion process 
for accounting standards using a questionnaire 
methodology. The questionnaires were sent to 
members of the FASB Advisory Council and of 
the FASB’s sponsoring organizations to study the 
association between perception, communica- 
tion channels and nine accounting standards. 
Hussein’s major contribution is the establish- 
ment of a conceptual framework for analyzing 
this process. In Hussein’s framework, he specifi- 
cally recognized both the vertical and the 
horizontal channels of communication. The em- 
pirical results showed that financial accounting 
standard setting is indeed a two-dimensional dif- 
fusion process. 

To date, most sociological oriented studies 
have used a questionnaire methodology to 
examine aspects of the accounting technology 
diffusion processes. Some support the diffusion 
theory and all recognize the importance of non- 
economic factors. Hussein (1981) acknow- 
ledged that “survey data are mostly useful in 
studies of a static nature” (p. 36) and suggested 
that other methodologies should be useful for 
studying. processes. In addition, none of these 
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studies specifically tests the three alternative 
process models proposed in the diffusion litera- 
ture. The next section discusses these process 
models. ' 


TECHNOLOGY DIFFUSION PROCESS MODELS 


Three basic models have been developed to 
study the process of technology adoption. They 
are the non-imitation model (Fourt & Wood- 
lock, 1960; Coleman, 1964), the imitation 
model (Dodd, 1958; Mansfield, 1968) and the 
generalized model (Rogers & Shoemaker, 1971; 
Mahajan & Schoeman, 1977). These three 
models serve as competing explanations of the 
diffusion process. The non-imitation model (also 
called the external-influence model) assumes 
that the sources of the diffusion process are ex- 
ternal to the social system. There is.no imitation, 
Le. there is no interaction among system mem- 
bers. The imitation model (also called the inter- 
nal-influence model) assumes that all adopters 
(except the first one, who is an innovator) are 
imitators, i.e. an adopter adopts a technology 
only after observing another adopter using the 
technology. The generalized model (also called 
the mixed-influence model) is a combination of 
the above two models. It assumes that the adop- 
tion decision is made partially through external 
sources and partially through imitation. 

It is also useful to examine these models from 
a competitive or institutional isomorphism per- 
spective (DiMaggio & Powell, 1983; Dirsmith, 
1986). DiMaggio & Powell (1983) analyzed the 
homogeneity of organizational forms and prac- 
tices. They maintained that there are two types 
of isomorphism: competitive and institutional. 
The competitive isomorphism concept em- 
phasizes market competition. It suggests that the 
change in organizational forms and practices is 
driven by competition and the need for effi- 
ciency and economic fitness; social fitness is not 
considered. This concept is consistent with the 

‘non-imitation model. DiMaggio &_ Powell 
(1983), however, stated that competitive 
isomorphism “may apply to early adoption ofin- 
novation, but does not present a fully adequate 
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picture of the modern world of organizations” 
(p. 150). They suggested that organizations ag- 
gressively compete for political power and in- 
stitutional legitimacy, in addition to resources 
and customers, therefore institutional 
isomorphism is introduced as an additionally 
useful concept. 

The institutional isomorphism concept 
emphasizes social fitness, i.e. organizations tend 
to become isomorphic despite the differences in 
underlying technical operations. Institutional 
isomorphic change occurs through three 
mechanisms: coercive, mimetic and normative. 
Coercive isomorphism results from external 
constituents’ pressures, mainly from govern- 
ment mandate, to adopt new practices. Mimetic 
isomorphism results from organizational copy- 
ing, ie. organizations model themselves after 
their peers that are perceived as more success- 
ful. Normative isomorphism results from profes- 
sionalization, i.e. the collective struggle of the 
members of a profession to establish the profes- 
sion’s legitimacy and to define the conditions 
and methods of its members’ works. It is 
achieved through formal education and profes- 
sional networks. The imitation model discussed 
before is generally supported by the concepts of 
mimetic and normative isomorphism. The 
generalized model basically fits the descriptions 
of both competitive and institutional isomor- 
phism. 

The study of LIFO adoption decision in the 
technology diffusion framework has been 
suggested by Copeland & Shank (1971) and 
Tritschler (1970). In the LIFO adoption decision 
context, the LIFO method is the technology. The 
non-imitation model claims that the adoption of 
LIFO is induced (or forced) mainly by interac- 
tion with the investors, creditors and public. The 
manager of a non-LIFO firm may adopt LIFO for 
economic fitness reasons, for example, he or she 
feels pressure from the investors ta improve the 
firm’s cash flow position. Social fitness, e.g. other 
firms’ selection of inventory methods, has no 


_influence on the decision. The imitation model 


claims that the diffusion process is mainly 
caused by imitation; therefore, a firm adopts 
LIFO with the rationality of mimetic or norma- 
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tive isomorphism. The manager of a non-LIFO 
firm may adopt LIFO simply because other firms 
in the social system choose LIFO. The demand or 
welfare of the investors, creditors and the public 
is not the major concern. The generalized model 
claims that the diffusion process can be 
explained partially by external factors and par- 
tially by imitation, i.e. with the rationality of 
competitive and institutional isomorphism. 

If the number (population) of potential adop- 
ters is fixed,’ then the three models can be 
specified as follows: 

(1) non-imitation model: 

Si = beta, (N —k;), 
(2) imitation model: 
Se = betaye, (~—*! 
(3) generalized model: 


dk, 


EH = beta, (N — k,) + betak, (Ay, G) 


where N is the number of potential adopters, k, 
is the cumulative number of adopters at time ¢,° 
beta, is an index of uninfluenced adoption and 
beta, is an index of influenced adoption (by 
imitation).’ 

The non-imitation model assumes that the dif- 
fusion process proceeds from an outside source, 
independent of the number of firms that have 
already adopted LIFO. The impact of the exter- 
nal source on non-LIFO firms is represented by 
beta,. The number of new LIFO adopters per 
unit time is proportional to the total number of 
firms that have not yet adopted LIFO. The imita- 
tion model assumes that firms are able to imitate. 


a) 





) and (2) 
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The LIFO adoption rate is dependent not only on 
the number of firms that have adopted LIFO but 
also on the proportion of the maximum number 
of potential adopters which are still using a non- 
LIFO method. The term k, (N — k,) represents 
interaction between the adopters and the poten- 
tial adopters at time t. The generalized model is 
a combination of the above two models. The sol- 
utions for equations (1), (2) and (3) are: 
(1) non-imitation model: 
k,=N— exp(1n(N— &o) — beta; (t— to)), (4) 
(2) imitation model: 











Nexp(in(*—) + beta, (t — fo)) 
k= Z Bo y and (5) 
1 earms 
exp( NSE ) + beta, (t -t%)) +1 
(3) generalized model: (6) 
kabeta, + Nbeta, 
Nesp ti a) tU) beta + beta,)) ~ beta, 
ate b. beta, + Mbeta; ' 
Se pate ar (fg) (beta, + beta,)) + beta, 


where k is the cumulative number of adopters 
at time Zo. 


The cumulative adoption time paths (&,) gen- 
erated by the three models, although S-shaped, 
are not identical because of the assumptions 
made, i.e. adoption is caused by interacting only 
with system outsiders, only with other firms or 
both. A hypothetical example is provided in Fig. 
2 to highlight the behavioral patterns. In this 
example, the number of adopters at t is one, the 
total number of potential adopters is 100, the 
impact of uninfluenced adoption (beta, ) is 0.01 
while the impact of influenced adoption (beta) 
is 0.2. Figure 2 shows the 20-year patterns of 


*The process models are fixed population models. One requirement is that the number of potential adopters, N, is fixed and 
known. Although sample firms are drawn from publicly-held companies listed in the COMPUSTATF tape, some publicly-held 
firms are not listed and their inventory methods are therefore not known. Some of the listed firms’ inventory methods are not 
disclosed on the tape. Given these constraints, this study is more exploratory than conclusive. Other limitations are also 


“" discussed in the Conclusion section. 


“When applying the three basic process models, a global assumption is that “all relevant information has been captured by 
the models”, usually through the term &, (Mahajan & Peterson, 1985, p. 25). This is perhaps a too-strong assumption. 
However, these models are basic and can be used as the starting points for studying diffusion processes. 


"The nature of the indices is very similar to that of the regression coefficients of a regression equation. The values of the 
regression coefficients can be negative, zero or positive. The values of the two indices in the technology diffusion models, 
however, are cither zero or positive. The reason is that the models consider only the adoption process, not the abandonment 
process, Le. the cumulative number of adopters can only stay the same or increase over time. 
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Fig 2. An example. — Model 1, + model 2, * model 3. 


adoption for the non-imitation model (model 1), 
the imitation model (model 2) and the 
generalized model (model 3). The adoption 
patterns are different by visual inspection. 
Coleman ( 1964) has illustrated the usefulness 
of these idealized models by investigating the 
process of the adoption of a drug by doctors in 
four midwestern communities. He has found 
that the non-imitation model best describes the 
adoption ‘behavior of the “isolated” doctors, 
while the imitation model best describes the 
adoption behavior of the “integrated” doctors. 
The important factor is a doctor’s integration or 


isolation in the local medical community, as- 


measured by the number of times other doctors 
named him or her as a social friend or as a fre- 
quent fellow-discussant of medical cases.* The 
“isolated” doctors depend on advertising, the 





BEN-HSIEN BAO and DA-HSIEN BAO 


drug. salesmen and occasionaily informed 
patients (outsiders ) to make their decisions. The 
“integrated” doctors are in the social position to 
discuss with and learn from other doctors 


‘(insiders) and adopt the drug. In general, the 


non-imitation model is appropriate when system 
members are “isolated” or when information 
about the technology is only available from an 
external source. The imitation model is approp- 
riate “when a technology is complex and socially 
viable, not adopting it places system members at 
a comparative disadvantage" (Makajan & Peter- 
son, 1985, p.18). 

The LIFO adoption process and the drug adop- 
tion process are similar in many respects. Each 
has a technology (LIFO vs drug), social system 
insiders (other firms vs other dactors), social 
system outsiders (investors, creditors, public vs 
advertising, drug salesmen, patients), interac- 
tions between non-adopters and adopters and 
between non-adopters and outsiders through 
communication channels and adoption rates. In 
the LIFO adoption context, the focus, however, 
is not on the degree of integration or isolation, 
ie. itis not on the behavioral difference between 
the “isolated” firms and the “integrated” firms 
since the degree of integration or isolation ‘of a 
firm is not easily measurable and both types of 
firms are potential adopters. The focus is on the 
main determinants or drives of the adoption pro- 
cess of all the potential adopters, i.e. whether 
the managers of the non-LIFO firms make their 
adoption decisions by mainly considering the 


‘demand or welfare of and being influenced by 


the outsiders, by mainly considering the inven- 
tory method decisions of the adopting firms or . 
by considering both. , 
Realistically, the non-imitation model and the 
imitation model are limited and may not be 
entirely satisfactory for explaining accounting 
method choice by all the non-adopters while 
the generalized model is more appropriate. 
There are two plausible reasons for the 
generalized model’s appropriateness. Following 
Coleman’s (1964) rationale, the non-adopters 


®according to Coleman (1964), isolation does not necessarily imply physical isolation. The important element is whether a 


doctor maintains communication with other doctors. 
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are either “isolated” firms or “integrated” firms 
and their adoption process can best be des- 
cribed by the non-imitation model and the 
imitation model, respectively. The adoption pro- 
cess of the combined group, i.e. all the non-LIFO 
firms, therefore probably can only be described 
by the generalized model. The other reason is 
that due to the advances in communication, the 
manager of a firm does have more opportunities 
to discuss the firm’s decisions with outsiders, 
such as with the investors through shareholders’ 
meetings and to learn the advantages ofadopting 
a specific accounting method through profes- 
sional association meetings and publications and 
personal contacts. The second rationale is prob- 
ably closer to the reality and more consistent 
` with DiMaggio & Powell's (1983) isomorphism 
concepts. But even with the opportunities of 
interacting with both the outsiders and the insid- 
ers, the manager of a non-LIFO firm may still 
make his or her LIFO adoption decision for the 
firm by mainly considering the outsiders’ 
demand or welfare (the non-imitation model), 
by mainly considering other firms’ choices (the 


imitation model) or by considering both (the 


generalized model). 


Accordingly, the 
hypotheses are made. 


following competing 


Hypothesis 1. 

Non-LIFO firms adopt LIFO mainly after their managers’ 
consideration of the demand or welfare of the system 
outsiders (investors, creditors and public). Although the 
managers have opportunities to communicate with and 
learn from the managers of LIFO firms, this type of in- 
teraction has no significant influence on the LIFO adop- 
tion process, 
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Hypothesis 2. 
Non-:LIFO firms adopt LIFO mainly after their managers 
communicate with and learn from the managers of LIFO 
firms: Although the managers have opportunities to con- 
sider the demand or welfare of the system outsiders (in- 
vestors, creditors and public), this type of interaction has 
no significant influence on the LIFO adoption decision.- 


Hypotbests 3. 

Non-LIFO firms adopt LIFO after their managers consider 
the demand or welfare of the system outsiders (investors, 
creditors and public) and after their managers communi- 
cate with and learn from the managers of LIFO firms. 


As stated earlier, hypotheses 1 and 2 are res- 
trictive and expected to be rejected while hypo- 
thesis 3 is more realistic in the LIFO context na 
is not expected to be rejected. 


SAMPLE AND ANALYSES 


In order to examine the LIFO adoption.pro- 
cess in the 1965—1984 period, sample firms 
were selected from the COMPUSTAT tape.” The 
“split-half approach was adopted and four accu- 
racy-of-forecast tests were performed to com-' 
pare the process models. 


. Sample and 1965—1984 analyses 


A firm was included in the sample if both of 
the following criteria were met. 

(1) the firm existed during the entire 1965— 
1984 period; those firms that merged, bank- 
rupted or were newly incorporated after 1965 
are excluded. Merger is defined as a major 
merger.'° 

(2) The COMPUSTAT data item number 59 
clearly identifies the firm’s inventory valuation 


For accounting purpose, a firm reports the use of LIPO under two situations: (1) it chooses LIFO for tax purpose; it then must 
choose LIFO for accounting reporting purpose under the conformity rule of the IRS; and (2) it chooses non-LIFO for tax 
purpose but decides to choose LIPO for accounting reporting purpose. The former may be explained by the tax-benefit 
hypothesis, i.e. the adoption decision is influenced only by a tax factor. The latter probably is influenced by interactions with 
system members and by external factors other than tax. Both situations can be explained by the technology diffusion theory. 

The focus of this study is the LIFO adoption process for accounting reporting purpose and the samples are selected from the 
COMPUSTAT tape. The inventory methods chosen by firms for tax purposes are not available. Even if they are known and if 
most of the firms adopt LIFO for tax reasons (therefore using LIFO for accounting reporting), the results should be “biased” 
in favor of hypothesis 1, Le. the decision is influenced by outsiders such as investors and the IRS. The results of this study, 
however, suggest that non-economic factors are also important, Le. hypothesis 1 is rejected while hypothesis 3 is not. 


10A major merger is identified by COMPUSTAT footnote item number 1 code AB and its related codes. 
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TABLE 1. Cumulative number of LIFO adopters this firm was not classified as a LIFO firm if it 
Year’ Observed ‘switched to other inventory methods in a later 








year.” The size of the sample is 623 and the 





1965 8 
1966 ` 8 cumulative numbers of LIFO firms for each of 
1967 8 the years are reported in Table 1.14 
1968 8 The “split-half? approach requires the time 
1969 8 period to be divided into two subperiods. The 
sr i first is an estimation period while the second isa 
1972 10 forecasting period. The estimation period in this 
1973 12 section is from 1965 to 1977 and the forecasting 
1974 31 period is from 1978 to 1984.'5 
rA 33 The 1965—1977 data are used to estimate the 
ion a parameter values for the non-imitation model 
1978 6 (model 1), the imitation model (model 2) and 
1979 56 the generalized model (model 4). Although 
1980 65 equations (4) and (5) for models 1 and 2 are 
1981 70 non-linear equations, linear transformations can 
mie ts be done and beta, and beta, are estimated by a 
1984 82 linear regression procedure. Equation (6) for 
model 3 is also a non-linear equation but here no 
Sample size = 623. linear transformation is possible. The par- 
ameters are estimated by the hill-climbing pro- 
method from 1965 to 1984." cedure described in Kalwani & Morrison 


A firm in the sample was classified asa LIFO (1977). The estimated parameter values are re- 
firm in a particular year if all its inventories were ported in Table 2. t- and F-statistics and R? values 
valued on the LIFO basis in that year.'? However, for models 1 and 2 are also presented.'® The re- 


rae 
"The sample size is reduced to 2300 by criterion (1). It is further reduced to 623 by criterion (2) mainly beczuse many firms’ 
inventory methods prior to 1973 are not disclosed on the tape. The final sample size is still significantly larger than those in 
previous studies. 


"There are two other LIFO classification schemes: “any LIFO” and “predominantly, LIFO”. “Any LIFO” means that a firm is 
classified as a LIFO firm in a particular year even if only a small portion of its inventory is valued on the LIFO basis in that year. 
This scheme is not adopted since a single-digit code was used in COMPUSTAT for years prior to 1972. A major classification 
problem can be illustrated by the following example: in 1971, a firm used FIFO for product A, which had 80% of the total 
inventory value and LIFO for product B. The single-digit code did not disclose the use of LIPO for product B. “Predominantly 
LIFO” means that a firm is classified as a LIFO firm in a particular year if the majority of its inventory is valued on the LIFO ` 
basis, Le. the first digit of the code is a“2”, in that year. The major problem is illustrated as follows: a firm used LFO for product 
A and FIFO for product B in 1972 and 1973. It reported 55% LIFO and 45% FIFO at the end of 1972. During 1973 purchases 
for both products were the same but product A was sold at a faster rate. At the end of 1973 the firm reportec 40% LIFO and 
60% FIFO. The firm is classified as a LIFO firm only for 1972 even though the firm used inventory methods “consistently”. 
The classification scheme used in this study is restrictive and is not perfect, but should be more reliable than the alternatives. 


'3Since the described diffusion models do not specifically deal with “switch-backs”, those firms are not classified as LIFO firms. 


“The table shows the cumulative number of adopters in cach year, Le. the number of firms.in the sample, up to that year, 
adopted LIFO for all of its inventories, The numbers seem to be low because the “all-LIFO” criterion was used due to the 
COMPUSTAT coding problem discussed before. 


‘The 1965—1984 period is divided into two subperiods, the first twice as long as the second. This provides a longer 
estimation period and hence more accurate estimates, while the forecasting period remains long enough for calculating 
forecasting statistics. 


‘all the R? values reported are adjusted R? values. 
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TABLE 2. Estimated parameter values (1965-1977) 











Parameter Model 1 ‘Model 2 Model 3 
Beta, 0.004662 0.000001 
(t=5.51)* 
Beta, 0.158113 0.13 
(t= 6.29)* 
F 30.31* 39.57° 
R 0.73 0.78 





* Significant at alpha = 0.01. 


gression models are useful since the t- and F- 
statistics are significant and R? values are high. 
No regression statistics are reported for model 3 
since the coefficients are not estimated through 
a regression procedure. 

The estimated parameter values reported in 
Table 2 are used in equations (4), (5) and (6) to 
generate the predicted number of LIFO firms in 
the 1978-1984 period, ie. the forecasting 
period. The predicted numbers are reported at 
the top of Table 3. The closeness of the predic- 
tion (to actual observation) is examined by four 


accuracy-of-forecast tests: the chi-square value, 
the mean-square error, the regression measures 
and Theil’s U (Maddala, 1977).'’ The chi-square 
value tests the closeness between the forecasted 
and the actual numbers of LIFO firms. An insig- 
nificant chi-square value means that the predic- 
tion of that model is accurate. The mean-square 
error test is selected since it “seems to be the 
most popular” among mean-square error, mean 
absolute error and minimum number of errors in 
the prediction of turning points (Kennedy, 
1979, p. .149). The model with the smallest 


TABLE 3. Accuracy of forecast tests (1978—1984) 








Predicted 

Year Observed Model 1 Model 2 Model 3 
1978 46° 44.17 57.46 41.03 
1979 56 46.86 66.25 46.30 
1980 65 49.54 76.22 52.19 
1981 70 52.21 87.44 58.75 
1982 72 54.86 100.01 66.04 
1983 75 57.50 114.01 74.12 
1984 82 60.13 129.48 83.04 
Chi-square 31.38 47.61 8.49* 
(d£ = 4) : 
Mean-square error 0.067 0.169 0.022 
Regression measures: 

intercept —40.656. 26.580 20.629 

regression coefficient 2.055 0.444 "0,763 

(#=9.115) (t= 6.683) (t= 6.817) 

Theil's U 0.0036 0.0058 0.0021 





* Not significant at alpha = 0.01. 





“The chi-square value should be used as the primary criterion to judge forecasting accuracy because its significance can be 
tested. The other three tests are supplementary. There is no statistical test of significance for the mean-square error or Theil’s 
U. The results of the regression measures are weak in this study. 
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mean-square error is the best model (Maddala, 
1977, p. 344). A regression measure test that 
regresses actual values on predicted values is 
suggested by Mincer & Zarnowitz (1969). This 
is also a “popular” method and “if the intercept 
estimate tests insignificantly different from zero 
and the slope coefficient tests insignificantly dif- 
ferent from one” the model is said to be a good 
one (Kennedy, 1979, p. 156). Theil’s U is 
another popular test (Maddala, 1977, p. 346). It 
is zero in the case of perfect forecasts. The model 
with the smallest U is classified as the best 
model. 

The chi-square values for comparing actual 
and predicted numbers of LIFO firms in the 
1978-1984 period are presented in Table 3. Of 
the three models, only the third predicted num- 
bers of LIFO firms that were not significantly dif- 
ferent from the observed numbers. On this 
count, model 3 is the best model. Furthermore, 
Table 3 indicates that model 3 has the smallest 
mean-square error and Theil’s U statistic and is, 
therefore, the best model according to these 
criteria as well. The intercepts of regression 
measures for all models are different from zero. 
The deviation of model 3’s regression coefficient 
from one, however, is the smallest, Le. 24% vs 
106% for model 1 and 56% for model 2. The 
results given in Table 3 imply that the 
generalized model best explains LIFO-adoption 
decisions; hypotheses 1 and 2 are rejected while 
hypothesis 3 is not. 


Subperiod analyses 
Different firms may adopt LIFO in different 
years for a variety of reasons, therefore, the valid- 
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ity of the three diffusion process models can 
only be evaluated for a particular time period. 
According to the results of the last section, the 
LIFO adoption process in the 1965—1984 time 
period conforms most closely to model 3. If this 
time period is subdivided, then the validity of 
these three models has to be re-evaluated in 
accordance with the shorter time periods. ` 
The American Institute of Certified Public 
Accountants (1975) and Abdel-Khalik (1985) 
reported a significant increase in LIFO firms in 
1974. It is possible that the LIFO adoption pro- 
cess in the 1965—1973 period is entirely differ- 
ent from that of the 1974—1984 period, i.e. LIFO 
adoption decisions may be induced by different 


- forces during the two time periods. This can be 


stated in the null form of a hypothesis. 


Hypothesis 4, 
LIFO adoption process in the 1965—1973 period is the 
same as the process in the 1974-1984 period. 


The sample is divided into two periods: 1965— 
1973 and 1974-1984. The 1965—1973 data and 
equations (4), (5) and (6) are respectively com- 
bined to estimate parameter values. The esti- 
mated values are reported in Table 4. The avail- 
able regression statistics for models 1 and 2 
indicate that the estimation results are useful, i.e. 
t- and F-statistics are significant while R? values 
are high. 

The estimated parameter values then are used 
to forecast the number of LIFO firms in the 
1974—1984 period. The accuracy tests are also 
performed and the results are presented in Table 
5. The chi-square values are significant. The 
mean-square errors, regression measures and 


TABLE 4. Estimated parameter values (1965—1973) 


Parameter Model 1 
Beta, 0.000706 
(t= 3.74)" 
Beta, 
F 14.01* 
R 0.67 


Modei 2 Model 3 
0.0001 
0.046333 0.03 
(t= 3.87)° 
14.94° 
0.68 
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TABLE 5. Accuracy of forecast tests (1974-1984) 














Predicted 
Year Observed Model 1 Model 2 Model 3 
1974 l 31 11.90 12.06 11.07 
,1975 i 33 12.33 12.62 11.46 
1976 37 12.76 13.21 11.87 
1977 39 13.19 13.82 12.28 
1978 46 13.62 14.46 12.71 
1979 56 14.05 15.13 13.15 
1980 ` 65 14.48 15.83 13.61 
1981 70 14.91 16.56 14.07 
1982 72 15.34 17.32 14.55 
1983 75 15.77 18.12 15.05 
1984 82 16.20 18.95 15.56 
Chi-square 1442.67 1217.04 1580.26 
(d£ =8) 
Mean-square error 0.663 0.625 0.691 
Regression measures: 
intercept —125.735 —67.520 —107.94 
regression coefficient 12.869 8.026 12.34 
(t= 18.029) (t= 17.161) (t= 17.760) 
Thell’s U 0.0136 0.0132 0.0139 











Theil’s U's all suggest that the adoption proces- 
ses in the two periods are significantly different. 
However, an outright rejection of hypothesis 4 is 
inappropriate. The subdivision of the 1965— 
1984 period into the estimation and forecasting 
periods is “forced” by the observation that there 
is a significant increase in LIFO firms in 1974. 
The estimation period, therefore, is shorter than 
the. forecasting period; the length of the estima- 
tion period is considerably shorter than twice of 
that of the forecasting period. This may have 
effects on the accuracy of parameter estimates 
and subsequently bias the forecasting results. 
An alternative and indirect approach for test- 


ing hypothesis 4 is the independent examina- 
tions of the subperiods. The rationale is that if 
the data from the subperiods are supported by 
different technology diffusion models then the 
adoption processes in these subperiods are 
different. The conclusion is inconsistent with 
the previous results and the validity of the previ- 
ous results should be questioned. If the data 
from the two subperiods are supported by the 
generalized model, ie. the best model for the 
1965—1984 period, then the explanatory factors 
are consistent in these subperiods. However, if 
the data from the two subperiods are supported 
by a technology diffusion model which is not the 


TABLE 6. Estimated parameter values ( 1965—1969) 





Parameter Model 1 
Beta, 0 
(t= 1.00) 
Beta, 
F 0 
R? 1 





Model 2 Model 3 
0.00001 
0.0000* 0.00013 
(t= 1.00) 
0 





* A very small positive number. 
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TABLE 7. Accuracy of forecast tests (1970~1973) 

















Predicted 

Year Observed Model 1 Model 2 Model 3 
1970 8 8 8 8.04 
1971 10 8 8 8.04 
1972 10 8 8 8.05 
1973 12 8 8 8.06 
Chi-square 3° 3° 2.87° 
(df=1) 
Mean-square error 0.088 0.088 0.084 
Regression measures: 

intercept —1.92E16 —6.39E16 —1.33E3 

regression coefficient 2.40E15 7.99E15 1.67E2 

(f=2) (t= 4.270) (t= 39.38) 

Theil's U 0.0276 - 0.0276 0.0271 


* Not significant at alpha = 0.01. 


generalized model then the test results are 
inconsistent and useless. 

The 1965—1973 data are divided into the two 
subperiods, 1965—1969 and 1970-1973. The 
1965-1969 data are used to estimate parameter 
values, reported in Table 6, for forecasting the 
number of LIFO firms in the 1970-1973 
period.!® Table 7 summarized the results of 
accuracy-of-forecast tests. It shows that any one 
- of the models predicts well, i.e. a non-significant 
chi-square value, a small mean-square error and 
a small Thiel’s U. However, the results of accu- 


racy tests may be of questionable value since the 
t- and F-statistics from the estimation period for 
models 1 and 2, reported in Table 6, are not sig- 
nificant. Nonetheless, the generalized model is 
the best model since its statistics from the fore- 
casting period are the smallest among the three 
models. 

The 1974—1984 data are also divided into two 
subperiods: 1974-1979 and 1980—1984. The 
1974-1979 data are used to estimate parameter 
values. The results, presented in Table 8, are 
used to predict the number of LIFO firms in the 


TABLE 8. Estimated parameter values (1974—1979) 











Parameter Model 1 Model 2 Model 3 
Beta, 0.008171 0.0090 
(t= 6.33) 
Beta, 0.122625 0.0002 
(¢=8.77)* 
F 40.09° 76.99° 
E 0.91 095 





* Significant at alpha = 0.01. 


'8In order to perform the chi-square test, a degree of freedom (d£) of at least one should be obtained. This causes the 
estimation period to be, although longer, less than twice of the forecasting period. The results, therefore, are weaker than 


those for the entire 1965—1984 period. 
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TABLE 9. Accuracy of forecast tests (1980-1984) 








Predicted 
Year Observed Model 1 Model 2 Model 3 
1980 65 59.32 61.38 62.17 
1981 70 63.91 68.51 67.21 
1982 72 68.46 76.35 72.19 
1983 75 72.97 84.95 77.15 
1984 82 77.45 94.35 82.05 
Chi-square 1.629° '3.277* 0.305* 
(df= 2) 
Mean-square error 0.004 0.013 0.000 
Regression measures: : 
‘intercept: 13.943 36.189 16.215 
regression coefficient 0.860 0.475 0.784 
= 8,150) (t= 9.325) (t= 0.096) 
Theil's U 0.0008 0.0015 ` 0.0003 





* Not significant at alpha = 0.01. 


1980—1984 period. Accuracy-of-forecast tests, 
reported in Table 9, show that the generalized 
model predicts best. Based on Tables 7 and 9, 
both the 1965-1973 LIFO adoption process and 
the 1974-1984 process can also best be 
explained by a generalized model. This confirms 
the results from the 1965—1984 process: the 
LIFO adoption process can best be represented 
by considering the effects of communication 
both with outsiders and with other firms that 
have already adopted LIFO. 


CONCLUSION 


The findings of this research should be con- 
sidered within the content of its limitations. The 
first limitation is that the specified diffusion 
models are fixed-population models. The sample 
of this study is drawn from the COMPUSTAT 
tape which includes firms that trade their stocks 
on the major exchanges. The firms included in 
any one year are different from the firms 
included in another year. The reasons for exclu- 
sion are numerous. If a firm is not included in the 
tape for any one year then it is deleted. Ban- 
krupted and newly incorporated firms, there- 
fore, cannot be included in the sample and the 


dynamic aspect of growth or decline of the 
population is ignored. This problem limits the 
scope of this study,. Sharif & Ramanathan (1984) 
have developed several dynamic models by 
changing the variable N in equations (1), (2) and 
(3) to N, Le. the population is a prespecified and 
well-behaved algebraic function of time. Specifi- 
cation is difficult and mathematical tractability is 
low even if a well-behaved function can be 
found. l 

The second limitation is that the diffusion 
models focus only on the adoption process, 
ignoring the “abandonment” process. A firm may 
decide to switch from LIFO to other inventory 
methods. This problem may not be serious since 
Lindahl (1986) claimed that the number of 
“switch-backs” is considerably smaller than the 
number of LIFO adopters. 

Despite these limitations, the results of this 
study do indicate that: (1) the LIFO adoption 
decision process in the 1965—1984 period can 
best be explained partially by external sources 
and partially by imitation; and (2) the adoption 
processes in the 1965-1973 period and in the 
1974—1984 period can also best be explained by 
the joint influence of external factors and imita- 
tion; the explanatory factors for the two sub- 
periods are the same although the “weighting” of 
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the factors may differ. During the period as a. 
whole, LIFO adoption decisions were made 
based on the demand or welfare of the investors, 
creditors and public, i.e. after the economic con- 
sequences to the outsiders were evaluated and 
on the managers’ communication and profes- 
sional networks among themselves. This study 
empirically establishes the importance of non- 
economic factors for explaining LIFO adoption 
from a technology diffusion perspective. 
Several general observations can be offered to 
conclude this study. Numerous studies have 
been done to explain the choice of accounting 
methods by firms. Some focused on the diversity 


of choices, Le. why some firms chose a method - 


while other firms chose another method. Others 
focused on, the similarity of choices, i.e. why 
many firms chose the same accounting method 
for reporting purposes. Most of the studies, how- 
ever, focused on accounting choices per se. No 
attention was paid to the organizational and 
social environment in which accounting func- 
tions. 

Accounting does not exist in vacuum. It is a 
tool used by firms to plan and control their oper- 
ations and to communicate the results of opera- 
tions. It is a vital component of organizations 
with various structures and behaviors. Although 
firms may differ in structure and behavior, Di- 
Maggio & Powell (1983) argued that there is 
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also a tendency towards homogeneity in organi- 
zational forms and practices because of connec- 
tedness and structual equivalence. i.e. firms are 
tied to each other through transactions or net- 
work structures. The homogeneity in practices, 
e.g. accounting choices, may be driven by the 
reasons of economic fitness, i.e. the economic 
fitness of the firm can be improved and the need 
of its constituents can be fulfilled ifa firm adopts 
a particular accounting method. The 
homogeneity may also be driven by the reasons 
of social fitness, i.e. a firm is more socially fit if it 
adopts an accounting method which is used by a 
more successful firm or the professional mana- 
gers choose the same accounting method to 
legitimize their occupational autonomy. 

The economic fitness argument has been 
tested by solely examining the relationship 
between financial data and accounting choices. 
The results are inconclusive mainly because the 
social fitness argument was entirely ignored. 
The major argument of this study is that in order 
to understand accounting choices in general, 
and the LIFO adoption decision in particular, 
both the economic and the social reasons should. 
be considered. Evidence based on non-financial 
data and technology diffusion models basically 
supports this argument. There is much to be 
gained if future research on accouating choices 
can take the sociological factors into account. 
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Abstract 


Ongoing research into the impact of budget emphasis on the relationship between participation and 
managerial performance has failed to provide consistent results. The purpose of this study was to conduct 
a replication of this two-way interaction model, drawing on data from a range of organizations in order to 
promote result generalizability and to test whether the model fits the data. It is proposed that a potential 
reason for why the results of this study conflict with those of prior research is that their findings were based 


on non-random sampling. 


The recent paper by Brownell & Hirst (1986) 
has been the latest step in a series of studies 
addressing the question of whether or not 
superior managers’ evaluation styles have an im- 
pact on the relationship between participation 
_and subordinate manager performance. The 
sequence commenced with Hopwood (1972) 
speculating in his examination of cost center 
managers that evaluation styles focusing on ac- 
counting measures would have a negative effect 
on managerial performance. Otley (1978), in a 
study based on profit center managers could not 
corroborate Hopwood’s supposition, since his 
results suggested that a leadership style em- 
phasizing budget attainment was related to grea- 
ter levels of performance. Brownell (1982), in 
an effort to reconcile the Hopwood (1972) and 
Otley (1978) results, found that a high budget 
emphasis is effective in terms of managerial per- 
formance under conditions of high budgetary 
participation but not under low participation. 


_Brownell’s results were based on responses from. 





cost center managers drawn from one U.S. manu- 
facturing company. Finally, Brownell & Hirst 
(1986) attempted to reconcile the conflicting 
results in a study predicated on cost and revenue 





center managers’ responses drawn from one 


manufacturing organization in Australia. How-. 
ever, their results did not support the proposi- 
tion of there being a: significant three-way 
interaction between budget emphasis, budget- 
ary participation and task uncertainty affecting 
managerial performance. Neither could the 
study confirm Brownell’s earlier two-way in- 
teraction finding. 

The purpose of this study was to attempt to 
resolve at least partially the conflict between the 
Brownell (1982) and the Brownell & Hirst 
(1986) results by conducting a replication of the 


: Brownell two-way interaction model, examin- 


ing the impact of budget emphasis on the 
relationship between participation and perform- 
ance. Kerlinger & Pedhazur (1973, p. 447) 
emphasized the need for replication studies, 
since replication can enhance the external valid- 
ity and generalizability of prior research if previ- 
ous findings are subsequently supported. 

In conducting. the_replication,.two specific 
issues received close consideration. Firstly, 
Brownell & Hirst suggested that cross-national 
differences in samples (U.S. versus Australia) 
could explain why the performance hypotheses 


*Thanks are duc to Peter Brownell, Graeme Harrison and Hwang Soo Chiat for their comments on earlier drafts of this paper. 
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were not supported, but they could not develop 
an argument to explain why this might be so. 
There is no evidence in the literature that would 
infer that the cultural environment in which 
manufacturing firms operate in these countries 
is different in any way. Hofstede’s (1984) results 
have shown that there is virtually no difference 
between samples from the U.S.A., Australia and 
the U.K. on his four dimensions of culture. Since 
cross-national differences were not expected to 
drive research results in these three countries, 
this study reports findings from British firms. 


Secondly, the research undertaken by Hop- ` 


wood (1972), Otley (1978), Brownell (1982) 
and Brownell & Hirst (1986) all relied on data 


drawn from single organizations..As Brownell 


commented, the generalizability of his results 
may have been hampered by not only the non- 
random selection procedure within the subject 
organization, but also relying on data from only 
one firm. Since the validity of inferences made 
from a sample to a population is dependent upon 
the sample being representative of that popula- 
tion(e.g. Hopkins & Glass, 1978, p. 183), sample 
selection requires careful consideration. In 
order to ensure the reliability of research results, 
representative samples should be employed 
(Kerlinger & Pedhazur, 1973, p. 446). A ran- 
domly drawn sample is representative of the 
population (Hopkins & Glass, 1978, p. 185). In 
order to promote result generalizability and to 
limit the possibility of inadvertently controlling 
for some firm effect(s), a number of manufactur- 
ing organizations were sampled. 

Since Brownell’s model provided the basic 
framework for this research, the hypothesis 
underlying this study was as follows. 


Ho: there is no significant interaction between budgetary 


participation and budget emphasis affecting perform- 
ance. 
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METHOD 


Production managers in each of 30 firms ran- 
domly selected from consumer product manu- 
facturing organizations located in the north of 
Britain were personally requested to complete a 
mail questionnaire which asked for responses to 
three variables: budgetary participation, mana- 
gerial performance and budget emphasis. Ques- 
tionnaires were received from 26 managers, rep- 
resenting a response rate of 87%. 


MEASURES 


The Mahoney et al. (1963, 1965) manager 
self-rating measure was used to assess mana- 
gerial performance. The measure has eight 
performance dimensions and one overall effec- 
tiveness dimension. It was found in this study 
that the eight dimensions explained 47% of the 
variance of the overall effectiveness dimension, 
a result consistent with Mahoney et ai’s find- 
ings. Data analysis was based on the overall effec- 
tiveness dimension, a procedure consistent with 
both Brownell (1982) and Brownell & Hirst 
(1986). Descriptive statistics for the measure 
are contained in Table 1: 

Budgetary participation was assessed using 
the Milani (1975) six-item measure. Cronbach 
alpha (Cronbach, 1951) for the instrument was 
0.867. Table 1 contains descriptive statistics for 
the measure. 

To assess budget emphasis, Hopwood’s 
(1972) eight-item supervisor evaluation style 
measure was employed. Evaluative style was 
determined by Hopwood by the presence or 
absence of the criteria of meeting the budget and 
concern with costs among the top chree criteria 
as ranked by respondents out of the set of eight. 


TABLE 1. Descriptive statistics for independent and dependent variables 








Standard Theoretical range Actual range 
Variable Mean deviation Min. Max. Min. Max. 
Performance 3.923 0.484 1 5 3 5 
Budgetary participation 23.423 4.149 6 30 15 30 
Budget emphasis 9.385 0.752 2 10 8 10 
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Since many of Brownell’s (1982) respondents 
failed to complete the rank ordering properly, 
Brownell (1985) used, as was the case in this 
study, a five-point Likert scale for each of the 
eight criteria and the overall score for this mea- 
sure was based on the sum of the two criteria of 
meeting the budget and concern with costs. 
Table 1 presents the descriptive statistics for this 
measure. 


RESULTS 


The following model was employed to test the 
hypothesis. 

Y= bo + BX, + bX + bX, X te 
where Y is managerial performance, X, is 
budgetary participation (difference scores from 
mean, PP) and X; is budget emphasis: —1 for 
high budget emphasis and +1 for low budget 
emphasis (dichotomised at the mean). 

Table 2 presents the results of the regression. 
Since the coefficient of the interaction term, b, 
is statistically significant, the null hypothesis of 
no interaction is rejected at p < 0.05. However, 
the rejection is not in the expected direction. 
The negative sign associated with b, suggests 
that high(low) participation together with 
high(low) budget emphasis lowers managerial 
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performance, rather than increasing it.' Hence, 
the study confirms Brownell’s result of a signifi- 
cant two-way interaction between participation 
and budget emphasis affecting performance, 
though it fails to support Brownell’s finding that 
the interaction positively affects performance.” 


CONCLUSION 


In essence, results have now been produced 
that have shown a significant positive interac- 
tion (Brownell, 1982), no.interaction (Brownell 
& Hirst, 1986) and now a significant negative 
interaction between participation and budget 
emphasis affecting performance. The finding of 
this study that performance is enhanced in con- 
ditions of high(low) participation and 
low(high) budget emphasis, indicates that alter- 
nate pairings of these variables should be av- 
oided in the design of control systems, at least in 
the population sampled. 

Recall that Brownell raised the point that the 
generalizability of his results may have been 
hampered by relying on data from only one firm. 
Sampling based on single versus multiple organi- 
zations could have been responsible for the vari- 
ance in results between this study and those of 
Brownell’s and Brownell & Hirst’s. Based on the 


TABLE 2. Regression results from testing the hypothesis 











Variable Coefficient Value Standard error t p 
Intercept bo 3.923 0.091 43.14 <0.01 
Budgetary participation b, 0.007 0.023 0.30 n.s. 
Budget emphasis by —0.006 0.091 -0.07 ns. 
Participation X budget emphasis b, —0.050 0.023 <0.05 


1s. — not significant. 
R? = 19.6%, n = 26. 


—2.19 


' The observed values for budget emphasis were restricted to 8, 9 and 10. However, random sampling provides some 
confidence that the range of scores in the population for this variable was captured. The dichotomisation treated scores of 8 
and 9 as low and 10 as high. The alternative of treating 8 as low and 9 and 10 as high produces largely identical results, in that 
the interaction is still negative and significant (t = —2.54, p <0.02). 


? Given these differences in results, an examination was conducted to determine whether or not the linear model 
proposed to fit the data did not contravene the assumptions of regression analysis (Neter et al, 1985; Hirsch & 
Louderback, 1986). It was found that the specifications of the model as presented fit the data well in this study. 
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literature that argues for the need to employ rep- case. Nevertheless, further multiorganizational 
resentative samples (Kerlinger & Pedhazur, research is required to clarify whether or not 
1973; Hopkins & Glass, 1978), this may be the this is the critical issue. 


BIBLIOGRAPHY 


Brownell, P., The Role of Accounting Data in Performance Evaluation, Budgetary Participation, and 
Organizational Effectiveness, Journal of Accounting Research (Spring 1982) pp. 12—27. 

Brownell, P., Budgetary Systems and the Control of Functionally Differentiated Organizational Activites, 

Journal of Accounting Research (Autumn 1985) pp. 502-512. 

Brownell, P. & Hirst, M., Reliance on Accounting. Information, Budgetary Participation, amd Task 
Uncertainty: Tests of a Three-way Interaction, Journal of Accounting Research (Autumn 1986) fp. 241— 
249. 

Cronbach, L.J., Coefficient Alpha and the Internal Structure of Tests, Psychometrika (1951) pp. 297-334. 

Hirsch, M. L. & Louderback HI, J. G., Cost Accounting: Accumulation, Analysis, and Use (Bostcn: Kent 
Publishing Co., 1986). ‘ 

Hofstede, G., Culture’s Consequences (Beverly Hills: Sage Publications, 1984). 

Hopkins, K. D. & Glass, G. V., Basic Statistics for the Bebavioral Sciences (Englewood Cliffs: Prentice-Hall, 
1978). 

Hopwood, A. G., An Empirical Study of the Role of Accounting Data in Performance Evaluation, Journal of 
Accounting Research (Supplement 1972) pp. 156-182. 

Kerlinger, F. N. & Pedhazur, E. J., Multiple Regression in Bebavioral Research (New York: Holt, Einehart 
and Winston, 1973). 

Mahoney, T. A., Jerdee, T. H. & Carroll, S. J., Development of Managerial Performance: A Research 
Approach (Cincinnati: South-Western Publishing Co, 1963). 

Mahoney, T. A., Jerdee, T. H. & Carroll, S. J., The Jobs of Management, Industrial Relations (February 1965) 
pp. 97-110. 

Milani, K. W., The Relationship of Participation in Budget-setting to Industrial Supervisor Performance and 
Attitudes: A Field Study, The Accounting Review (April 1975) pp. 274-284. $ 
Neter, J., Wasserman, W. & Kutner, M. H., Applied Linear Statistical Models, 2nd ‘edn (Homewood, IL: 

Irwin, 1985} a ; 

Otley, D. T., Budget Use and Managerial Performance, Journal of Accounting Research (Spring 1278) pp. 

122-149. 


GEC òl 
Ac 2¢ 
Alh- 


Accounting, Organizations and Society, Vol. 14, No. 4, pp. 325-335, 1989. 


Printed in Great Britain 


0361—3682/89 $3,00+.00 
Pergamon Press pic 


THE ROLE OF BUDGET DATA IN THE EVALUATION OF 
MANAGERIAL PERFORMANCE* 
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Abstract 


This study was designed to test two of the contingency frameworks suggested by Otley (Journal of 
Accounting Research, 1978, pp. 122—149) as a possible explanation for the differences in results between 
his study and that of Hopwood (Supplement to Journal of Accounting Research, 1972, pp. 156-193). 
Using three manufacturing organizations in Northeastern U.S.A, it was found that the two contingency 
frameworks may not provide adequate explanation for the differences. 


The role of budget data in the evaluation of man- 
agerial performance and the determination of 
rewards in organizations has been emphasized in 
the accounting literature. Researchers have for- 
mulated and/or tested hypotheses concerning 
the consequences of the different ways or 
“styles” that budget data may be used in perform- 
ance evaluation and reward (Hopwood, 1972; 
Otley, 1978; Brownell, 1982). At best, the 
research results are inconsistent and inconclu- 
sive, however. For example, Hopwood (1972), 
using an American firm, found that managers 
evaluated on a style which used budget data in a 
rigid manner in performance evaluation re- 
ported significantly higher levels of job tension 
than managers evaluated on their contribution 
to organizational effectiveness. Otley (1978) 
replicated Hopwood’s study in a British firm. He 
found no significant differences in the levels of 
job tension and performance reported by mana- 


gers evaluated on the styles initially used by 
Hopwood. Faced with such differences, other 
researchers have formulated contingency frame- 
works where the effectiveness of using budget 
data in performance evaluation and reward is 
hypothesized to depend on the context in which 
the styles are used (Hayes, 1977; Otley, 1980; 
« Birnberg et al., 1983). 

Otley (1978, 1980) suggested a style—context 
framework to reconcile the differences in results 
between his study and that of Hopwood (1972). 
The style dimension consisted of a high and a 
low emphasis on budget data in performance 
evaluation. The context dimension consisted of 
managerial interdependency’ (high and low) 
and task uncertainty? (also high and low). 
Otley’s framework suggested’ that when mana- 
gers face high levels of interdependency or un- 

` certainty, a high emphasis on budget data in per- 
formance evaluation may be dysfunctional for 


*The author wishes to extend his appreciation to his dissertation committee members, Jake Birnberg (Chairman), Barry 
Lewis, Craig Russell, Larry Feick and Evelyn Perloff for their support throughout the duration of this research. The author's 
appreciation also goes to Neil Wilner, J. Harry Evans, Il, Jeff Pinto, Stan Biggs, Ken Merchant and two anonymous referees 
for their comments and suggestions on an earlier draft of this paper. 


‘Managerial interdependency is the extent to which each manager perceives his or her work-related activities to require the 
joint or cooperative effort of other managers within that organization (Osborne et al., 1980). 


?Task uncertainty is the extent to which managers can predict confidently the factors which have effect on their work-related 


activities (Duncan, 1972; Otley, 1978). 
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managerial performance. A high emphasis on 
budget is more effective than a low emphasis in 
contexts characterized by low levels of task 
interdependency or uncertainty. However, 
although suggestive, such a contingency formu- 
lation has not been subjected to rigorous empir- 
ical analysis. 

The current research was designed to test the 
two contingency models formulated by Otley 
(1980). The study tested the effectiveness of 
using budget data in performance evaluation and , 
reward in task contexts characterized by high or 
low levels of interdependency or uncertainty. 

Based on the above frameworks, the rest of the 
paper is organized in the following manner: the 
next section provides a review of the literature 
and a specification of the hypotheses tested. The 

‘research method and results are presented 
thereafter. This is followed by discussion and 
conclusion. 


LITERATURE REVIEW AND HYPOTHESES 


Previous research in accounting has identified 
some of the consequences of using budget data 
in performance evaluation. The arguments and 
findings suggest that when a high emphasis is put 
on using budget data for performance evaluation 
and reward, managers may report job tension 
and behave in ways which are dysfunctional for 
organizational goals. However, the arguments 
and conclusions were based on inconsistent and 
inconclusive empirical-results or case evidence 
(Argyris, 1952; Simon et al., 1954; Hofstede, 
1967, Lawler, 1976). 

Hopwood (1972) was the first researcher to 
systematically test for the consequences of the 
different styles of using budget data in perform- 
ance evaluation. The three styles he used in his 
empirical test were: (1) a budget constrained 
style (BCS) where budget data was used in a 
rigid manner in performance evaluation; (2) a 
profit conscious style (PCS) where the emphasis 
in performance evaluation was on contribution 
to organizational profitability; and (3) a non- 
accounting style (NAS) where budget data 
played a relatively unimportant role in perform- 
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ance evaluation. Hopwood tested for the conse- 
quences of these styles and found the BCS, com- 
pared to PCS or NAS led to greater stress, more 
hostility toward supervisors and peers, a greater 
tendency to blame other managers for poor per- 
formance outcomes and an increased tendency 
to charge departmental costs to other depart- 
ments. He also reported that BCS managers were 
more likely than PCS or NAS ones to falsify their 
budget data and were less likely to meet their 
budgets. Based on the differential effect of the 
three styles on stress, Hopwood suggested that 
BCS would have a more negative impact on man- 
agerial performance than PCS or NAS. 

- Otley (1978) replicated Hopwood’s study ina 
British firm that was well suited to the use of PCS 
in performance evaluation. He did not confirm 
Hopwood’s initial results. There were no signifi- 
cant differences in the levels of performance and, 
stress reported by managers evaluated on BCS 
and those evaluated on PCS. Otley’s BCS mana- 
gers were more likely to meet their budgets than 
his PCS managers, a result that was opposite to 
that of Hopwood. Three explanations were 
offered for the differences in results. First, 
Otley’s managers were said to operate more in- 


‘dependently of other units within the same or- 


ganization than Hopwood’s managers. Secondly, 
Otley’s managers were profit center managers 
whereas Hopwood’s subjects were cost center 
managers. Finally, Hopwood’s managers were 
said to operate in a less predictable environment 
than Otley’s managers. 

The preceding review suggests some critical 
questions to ask: why did Hopwood’s BCS mana- 
gers report more stress than his PCS managers 
whereas Otley found no significant differences 
in the level of stress reported by his BCS and PCS 
managers? If one style of evaluating performance 
leads to more stress, does that style also lead to 
lower levels of managerial performance? Under 
what conditions do the different performance 
evaluation styles lead the managers to report 
favorable attitudes towards their budgets? 

These questions are interesting to ask because 
organizations may show an interest in the differ- 
ences in the performance evaluation styles if it 
can be consistently demonstrated that one style. 
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enhances performance more than another. 
Secondly, the effects of the different styles of 
evaluating managerial performance on attitude 
towards the budget may be interesting to study 
since it has been suggested that managerial 
attitudes towards the budget may influence how 
managers use budget data in decision-making 
and problem-solving (Hirnberg et al., 1983). 
The more favorable the attitude, the more confi- 
dence managers may have in using budget data 
in decision-making. 

The effects of the interaction between budget 
evaluation style and task interdependency or un- 
certainty on stress, performance and managerial 
attitude have been given limited attention in 
empirical research in accounting: Only one 
study has investigated the appropriateness of 
using accounting data in performance evalua- 
tion in task contexts characterized by different 
levels of interdependency (Hayes, 1977). In that 
study, interdependency was not measured em- 
pirically but was assumed. Similarly, few re- 
search studies have addressed the effectiveness 
of accounting data in performance evaluation 
when task uncertainty is controlled (Rockness & 
Shields, 1984; Gordon & Narayanan, 1984). In 
all of these studies, no consistent result has 
emerged. 

It is argued in this study that because tasks in- 
volving interdependency or uncertainty may be 
subject to evaluation using budget data, mana- 
gers may be concerned with how such budget 
data are used for their evaluation and reward. In 
tasks characterized by high uncertainty or inter- 
dependency, managers may perceive them- 
selves as having less than full control over per- 
formance outcomes. Using budget data in a rigid 
manner in such situations may be dysfunctional 
for performance since a rigid use of budget data 
assumes that most of the factors which have an 
effect on task outcomes are within the control of 
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the managers being evaluated (Ansari, 1979). 
On the other hand, a rigid use of budget data may 
be more acceptable to managers if they perceive 
themselves able to exercise control over their 
performance outcomes. This would be the case 
in tasks characterized by low uncertainty or 
interdependency. In other words, it is not 
budget style per se which may lead to higher 
stress or lower performance. Rather, it is the mis- 
match of budget style and task contexts which 
may enhance stress or reduce managerial per- 
formance. The arguments presented above 
formed the basis for the following hypothesis: 


There will be significant interactions between: budget 
style of performance evaluation and task interdepen- 
dency on the one hand and between budget style and task 
uncertainty on the other, on job stress, performance and 
attitude toward the budget, Le. when tasks are charac- 
terized as high on interdependency or uncertainty a BCS 
used in performance evaluation will lead to greater 
stress, lower performance and less favorable attitude 
toward the budget than the use of 2 PCS. On the other 
hand, when tasks are low on interdependency or uncer- 
tainty, a BCS used in performance evaluation will lead to 
higher performance, lower stress and a more favorable at- 
titude toward the budget than the use of a PCS. 


RESEARCH METHOD 


The research method was a field study using a 
questionnaire survey. Three organizations par- 
ticipated in the study. The organizations in- 
cluded a tool manufacturing company with a 
sales volume of $460 million and an employee 
size of about 4000 in 1983, an equipment divi- 
sion of a manufacturing company with sales of 
$1 billion and an employee size of 9700 in 1983 
and a hospital manufacturing company with 
sales of $300 million and an employee size of 
3400 in 1983.° 

Participants in the study were cost center 
managers‘ with budget responsibilities ranging 


>The first two companies had reduced their employee size by almost 40% between 1982 and 1983. Both had reported 
financial losses for 8 consecutive quarters prior to the time the research was started. The third company had reported 
declining profits for 4 consecutive quarters prior to the time of the study. 


‘although attempts were made to include profit center managers in the study, two of the participating organizations would 
only allow their cost center managers to participate in the study. Since the third company did not have enough profit center 
managers to act as a control sample, it was decided to restrict the study to cost center managers. The bias this choice of 
managers could possibly have on the results of this study should be noted when interpreting the results. 
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from $200,000 to $1.5 million a year. The par- 
ticipants had been in their job positions for an 
average of 5 years. Each had also worked for the 
organization they were employed in for an aver- 
age of 13 years. 


Research procedures 

The participants were identified by the 
researcher and the coordinator in each organiza- 
tion using the organization chart.’ The criteria 
used in selecting the participants included: (1) 
each participant should have budget responsibil- 
ity; (2) each person must have been in that par- 
ticular position for at least 2 years; (3) two par- 
ticipaùts with the same supervisor should not 
participate in the study simultaneously; and (4) 
no manager who had participated in the pilot 


test phase should participate in completing the. 
questionnaire. The research was undertaken in 


three phases. 


Phase 1. An unstructured interview was 
undertaken in two of the participating organiza- 
tions. Managers with budget responsibilities 
were used for these interviews. The purpose of 
this phase was 4fold: (1) to gain an insight into 
the organization’s budgeting process; (2) to 
determine the appropriate level of management 
to participate in the study; (3) to identify how 
the budget styles used in the research man- 
ifested themselves in the organizations which 
participated in the study; and (4) to identify 
whether there were other important issues or 
problems faced by the participants besides the 
ones studied. : 


Phase 2. Some of the measures used in the 
study were pretested in two of the three organi- 
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zations which participated. The pretest subjects 
were encouraged to put comments on any items 
which were not clear to them. Based on the pre- ` 
test, some items were reworded, others were 
eliminated, while a few were added. In general, 
the interpretation of most of the items by the 
participants was consistent with the way the 
researcher wanted them interpreted. 


Phase 3. One hundred packages were sent to 
each of the three organizations to distribute to 
participating managers. Each package contained 
two questionnaires (one for the participant and 
the second for the supervisor of the participant), 
two prepaid postal envelopes to be used to 
return the questionnaires directly to the re- 
searcher and two letters. One letter was from the 
researcher explaining the purpose of the re- 
search and soliciting their participation. The 
second letter was from a Vice-President of the 
participating company explaining that the ques- 
tionnaires would be used for research purposes 
only. They were informed in the Vice-Presi- 
dent’s letter than participation was voluntary. 
The questionnaire itself contained statements 
assuring the participants that their responses 
would be treated with absolute confidentiality. 
Each package was sent to the supervisor or boss 
who was expected to complete his or her part 
and give the second part to the subordinate so, 
named by the Vice-President in his memo. 

Table 1 contains the responses of managers 
from each participating organization. Although 
300 questionnaires were given to the coor- 
dinators to distribute among their managerial 
employees deemed appropriate for the study, a 
total of 188 questionnaires were actually given: 
out in the three organizations.’ 


‘The coordinators were appointed by each company on the basis of their familiarity with the budgeting process in that 


particular organization. 


The third company would not agree for an extensive interview to be conducted. However, interviews were held with some 
of the budgeting staff and two Vice-Presidents to confirm that budget was used in that company in a manner deemed 
appropriate for this study. The company was included in the study when it was found that their budgeting procedures were 


consistent with prior expectation. 


*The coordinators only gave the questionnaires to those managers and their supervisors who agreed to participate in the 
study when they were contacted on the telephone. Since the coordinators did not keep any record of those managers who 
refused participation, it was not possible to find out how much non-participation would bias the results of the study. 
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TABLE 1. Responses by organization 








Organizationl Organization2 Organization3 Total 
Total questionnaire* 100 100 100 300 
Total given out 45 76 -67 188 
Number of responses 27 52 41 120 
Response ratet 60% 68% 61% 64% 
Number usable 24 41 37 102 
Usable response ratet 53% 54% 58% 54% 





*Number delivered to the coordinators in each organization. 


TAs a percentage of questionnaires given out. 


Of the number distributed, 120 were 
returned? giving a response rate of 64%. Eigh- 
teen questionnaires were dropped from the 
analysis either because all parts of the question- 
naires were not completed or because the re- 
spondents confirmed in one statement that they 
did not have budget responsibilities. One 
hundred and two questionnaires representing 
54% of the total given out were used for analysis. 


Measurement and validation of variables 
Task interdependency. Task interdependency 
was defined as the extent to which each particip- 
ant perceives his or her work-related activities 
to require the joint or cooperative efforts of 
other work-groups within that particular organi- 
zation. A 2-item, 5-point, Likert scale from 
Osborne et al. (1980) was used in the study. 


Task uncertainty. Task uncertainty was 
defined as frequency of change of factors affect- 
ing work-unit performance (Duncan, 1972). A 5- 
item, 5-point, Likert scale from Podsakoff et al. 
(1983) was used to measure the variable. 


Job stress. Stress was defined as the extent to 
which an individual faces situations charac- 
terized by incompatible demands on his or her 
role performance or when two or more indi- 





viduals make incompatible or inconsistent 
demands on the focal individual (Rizzo et al., 
1970). A 7-item, 7-point, Likert scale from Rizzo 
et al. (1970) was used for the study. The instru- 
ment used for this research was different from 
that used by Hopwood (1972) and Otley 
(1978). 


Performance. Seven dimensions of managerial 
performance used for empirical analysis by Por- 
ter & Lawler (1968) and Van de Ven & Ferry 
(1980) were used for this research. The dimen- 
sions include quality of work, attitude towards 
work, effort, speed of work, frequency of useful 
ideas suggested, frequency of meeting work 
goals and cost efficiency of the work unit of 
which the particular manager is head. These 
dimensions were chosen because they were in- 
cluded in the performance appraisal forms used 
by the three companies for evaluating the per- 
formance of managerial employees. The super- 
visors of the participating managers were 
required to rate the participants on a 7-point 
Likert scale ranging from “well below average” 
to “well above average” on the 7 dimensions tak- 
ing into consideration all subordinates he or she 
had supervised over the previous 3 years. 
` Attitude towards the budget. Respondents 
were required to indicate how favourable or un- 


*sixty-eight manager/supervisor pairs who had initially agreed to participate in the study did not return their questionnaires. 
This number includes managers in two departments where the Vice-Presidents who had given prior approval to their 
departmental participation in the study later changed their minds. These two VPs could not be contacted to know why they 
vetoed their departmental participation. The other managers who did not return their questionnaires likewise could not be 
contacted because the coordinators were precluded by their organizations from allowing the researcher to deal directly with 


participating managers. 
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favourable their attitudes were towards the 
budgeting staffs. The scale was measured using a 
2-item, 7-point, Likert scale made for the study. 


Budget evaluation styles. The styles initially 
used by Hopwood (1972) and Otley (1978) 
were used for this study. They included (1) 
budget constrained style (BCS), (2) profit con- 
scious style (PCS)? and (3) non-accounting style 
(NAS). 10 

Eight items developed by Hopwood (1972) 
were used'' for data analysis. Participating mana- 
gers were required to indicate on a 7-point 
Likert scale the extent to which they perceived 
their supervisors to use the 8 items in their per- 
formance evaluation. Each manager was as- 
signed to the style on which he or she had the 
highest average score. Fifty-eight managers were 
classified as being evaluated on BCS,!? 40 on PCS 
and 4 on NAS. The 4 managers who reported that 
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they were evaluated on NAS were dropped from 
the analysis because this group were not suffi- 
ciently numerous for the empirical tests. 

The descriptive statistics and internal reliabil- 
ity of the variables used in the analysis are sum- 
marized in Table 2. Cronbach coefficient alpha 
(Cronbach, 1951) was used to determine inter- 
nal reliability of each multi-item variable. 


RESULTS 


The following equation was tested using 
multi-variate linear regression (SPS5x, 1986, pp. 
527-533): 

Yı = Oy + byZ + byX, + byXz + byXıZ + 
By X2Z + byX X2 + byXX2Z + €y 
where 
J= 1,2,3, 
Y, = job stress, 


TABLE 2. Descriptive statistics of variables used in empirical analysis 





No.of Theoretical Actual 


Standard Cronbach’s 





Items* range range Mean deviation alpha 
1. Taskinterdependency 2 2-10 2-10 7.3 18 0.77 
2. Task uncertainty 5 5-25 9-25 16.0 3.3 0.79 
3. Job stress 7 7-49 14-47 28.1 6.3 0:73 
4. Performance 7 7—49 22—42 32.7 5.0 0.94 
5. Attitude towards the budget 2 2-14 2-14 90 2.4 086 





*No. of items used for empirical analysis. 





In the two organizations in which the interviews were carried out, PCS manifested itself in the form of how cost effective 
the respective managers were in their operations, Hopwood (1972) also reported this concern in his study. 


although budget data may not be considered as important in performance evaluation, the style does not preclude the use 
of budget information for other purposes such as problem-solving. This study only focused attention on one type of use to 
which budget data may be put, Le. in evaluating performance and determining rewards in organizations. 


"These 8 items were included in a list of 21 items given to the managers and their supervisors to respond to. The 21 items 
also included 13 others generated during the interview with same of the participating managers. All of the 21 kems combined 
represent what the managers considered as most important in their performance evaluation. The list was also checked with 
some supervisors and human resources aa a E lea 
the companies in performance evaluation. 


"Both Hopwood (1972) and Otley (1978) asked participating managers in their studies to rank-order the items they used 
in measuring their budget styles. The ranks were used to derive the three styles they used in empirical analysis. This research 
used a simple average of the raw scores to determine the styles used in empirical analysis. The differences in the way style 
was measured between this study, on the one hand, and Hopwood (1972) and Otley (1978) on the other, should be noted 
in interpreting the results reported in this study. It is possible that differences in measurement may have influenced the 
results reported in this study. 


ROLE OF BUDGET DATA 


= performance, 
Y, = attitude towards the budget, 
X, = task interdependency, 
X, = task uncertainty, and 
Z = budget style (code 1 for BSC, 0 for 
PCS). 

Multi-variate linear regression is appropriate 
to use when there are two or more dependent 
variables being regressed on one or more inde- 
pendent variables simultaneously. In the test 
carried out, performance, job stress and attitude 
towards the budget were used as dependent var- 
iables. The independent variables included 
budget style (coded 1 for BCS and 0 for PCS), 
task interdependency, task uncertainty as well as 
interaction terms between budget style and task 
interdependency, budget style and task uncer- 
tainty, task uncertainty and task interdepen- 
dency and budget style, task interdependency 
and task uncertainty. 

The results of the test of the hypotheses are 
reported in Table 3. The regression coefficients 
representing a test of hypotheses are presented 
by 84,and 8&., in Table 3. As can be observed from 
the table, none of the six regression coefficients 
tested was significant at the 5% alpha level. This 
means that none of the interaction terms of 
budget style and task uncertainty as well as 
budget style and task interdependency on per- 
formance, job stress and attitude towards the 
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budget was significant. Managers who reported 
that they were evaluated on BCS and who oper- 
ated in task contexts characterized as high or 
low on interdependency or uncertainty did not 
report higher performance, job stress or more 
favorable attitudes toward the budget than PCS 
managers. 

Also from Table 3, it can be seen that the 
regression coefficients from budget style to job 
stress and from budget style to attitude towards _ 
the budget were significant at an alpha level of- 
5%. To test for the direction of significance, a t- 
test was carried out with job stress and attitude 
towards the budget as the dependent variables 
and budget style (coded 1 for BCS and 0 for PCS) - 
as the independent variable. It was found that 
BCS managers reported higher job stress than 
PCS managers. This result is consistent with that 
reported by Hopwood (1972) as well as the 
related conceptual literature (Hirst, 1981). BCS 
managers also reported a more favorable 
attitude towards the budget than PCS managers. 
This finding is similar to that of Otley (1978) 
who reported that BCS managers were more 
likely to meet their budgets than PCS managers. 


DISCUSSION 


The study focused on the. effects of budget 


TABLE 3. MRLA test of interaction effect of budget style and task interdependency or uncertainty on job 
stress, performance and attitude towards the budget 








Dependent variabies 
Job Attitude towards 
stress Performance the budget 

Direct effects of: 

Budget style (BS): 8, 1.67ł ns —1.24+ 

Task interdependency (TI): 8y ns ns ns 

Task uncertainty (TU): 8, ns ns ns 
Interaction effects: 

BS X TE By ns ns ns 

BS* TU: 84, ns ns ns 

TI” TU: 8g ns ns ns 

BS X TI x TU:8, ns ns ns 
Uni-vartate F 5.22§ ns 1.85° 





*pS0.10; tp<0.05; $p<0.01; §p<0.001. 
ns: not significant. 
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style of evaluating managerial performance on 
job stress, attitude towards the budget and per- 
formance in task contexts characterized as high 
-or low on interdependency or uncertainty. As 
reported in the previous section, the hypotheses 
tested were not supported. In other words, the 
contingency frameworks suggested by Otley 
(1978) may not provide adequate explanation 
for the differences in results between Hop- 
‘wood’s study and that of Otley. 


There are two other possible explanations for 


the differences in results between the two 
studies, however. These are the financial condi- 
tion of the organization and reward uncertainty. 


Financtal condition of organization 
As explained earlier, all three organizations 
` which provided managers for this study had 


experienced financial losses or declining profita- , 


bility for 4—8 consecutive quarters prior to the 
time this research was conducted. Perhaps when 
organizations are facing financial hardship, the 
differential effects of performance evaluation 
styles on work outcomes may be difficult to iso- 
late empirically. It is likely that when companies 
are undergoing periods of declining profitabil- 
ity, there might be so much emphasis on 
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financial data in performance evaluation that it 
may be difficult to truly classify managers into 
different styles on the basis of how much consid- 
eration is given to budget data in performance 
evaluation." It is possible that the styles used for 
this research may only be found in organizations 
which have no severe pressures for financial per- 
formance. 

That financial difficulties may explain failure 
to provide support for the hypotheses tested in 
this research is further illustrated by Table 4 
which provides a comparison of the distribution 
of NAS in this study with those in Hopwood 
(1972), Otley (1978) and Brownell (1982). In 
the studies of Hopwood and Brownell where the 
hypotheses tested were supported, 44% and 
31%, respectively, of the managers who partici- 
pated in the studies were classified as being 


` evaluated on NAS. On the other hand, in this cur- 


rent study and that of Otley where the hypoth- 
eses tested were not supported, the percentages 
of managers who reported they were evaluated 
on NAS were 4% and 3%, respectively.'4 
Perhaps when organizations are passing 
through financially difficult times, there are no 
managers with budget responsibilities who are 
not evaluated on the basis of budget data. There 


TABLE 4. Comparison of NAS in this study with Hopwood, Otley and Brownell 











Hopwood (1972) 167 
Otley (1978) 39 
Brownell (1982) 38 
This study 102 


Based styles? NAS Hypotheses 
56% 44% Supported 
97% 3% Notsupported 
69% 31% Supported 
96% 4% Notsupported 








*Budget based styles include Hopwood’s BCS and PCS. 


Otley (1978) made a similar_argument that the style used in performance evaluation may be due in part to economic 
circumstances. According to Otley, a profit center experiencing financial difficulty was more likely to formulate optimistic 
budget standards than another profit center experiencing reasonable profitability. This is because when a profit center is 
reporting declining profitability, the manager was more likely to be evaluated either on multiple criteria or in a flexible 
manner than when he or she was experiencing profitability. The explanation offered by this research that the financial 
condition of the organization as a whole may explain differences in results between the studies of Hopwood and Otley in fact 
supports Otley’s’suggestion that the economic situation of a profit center may determine how budget data are used in 


performance evaluation. 


"Professor David Otley, in a private communication with this researcher confirmed that the organization he used in his study 
was actually experiencing financial difficulty at the time he conducted his research. 


ROLE OF BUDGET DATA 


is also the possibility that during periods of poor 
financial performance, there might be so much 
emphasis on budget information and reports on 
the interactions between a manager and his or 
her supervisor that most managers may have a 
tendency to think they are evaluated on the basis 
of budget data even though that might not be the 
case in circumstances where the company was 
profitable. 

That the financial status of the organizations 
used for analysis may have influenced the results 
of this study is further supported by the follow- 
ing quotation from an interview conducted in 
phase 2 of the research: 


You are told to prepare your budget. . . they tell you to be 
daring, innovative and creative ... I spend weeks prepar- 
ing my plans. When they send you your approved budget, 
they cut under your feet. 


Another R & D manager also complained that: 


For example, based on your technical plan,'* you budget 
for an equipment of $50,000. When it gets there,'® they 
order 20% across the board cut. Meanwhile, you cannot 
buy 80% ofan equipment. Either you buy it or you don’t. 
Eventually, we end up with unworkable budget and tech- 
nical plans. 


It is likely that organizations which face 
financial pressures run their budget processes as 
described in the statements quoted above. It is 
probable that across the board cuts in all areas of 
operations are found in organizations experienc- 
ing financial difficulties. 


Reward uncertainty 
A second reason the hypotheses tested in this 
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study have not been supported may be a result of 
the managers used for analysis not being aware 
of how they were evaluated by their supervisors. 
In this study, 21 performance evaluation 
criteria" were given to both managers and their 
supervisors to complete. The managers were 
required to indicate to what extent they thought 
their supervisors used the 21 items in their per- 
formance evaluation. The supervisors were also 
required to indicate the extent to which they 
actually used the 21 items in evaluating the man- , 
ager jointly participating in the research with 
them. A Pearson correlation was run between 
both sets of responses. The correlations 
between both sets of responses for the overall 
sample, the BCS and PCS subsamples are re- 
ported in Table 5. Overall, 8 items were positive 
and significant, 3 were negative and significant 
and 11 were not significant. The comparative fig- 
ures for BCS and PCS managers are summarized 
in the table.'® 

When budget style was defined on the basis of 
the responses of the superiors, the superiors 
reported that they put less emphasis on budget 
data in performance evaluation than their subor- 
dinates reported. For example, 26 out of 102 
superiors reported that they evaluated their sub- 
ordinates on BCS, while 31 and 45 reported PCS 
and NAS, respectively. The corresponding fig- 
ures for the subordinates were 58, 40 and 4 for 
BCS, PCS and NAS, respectively. . 

That reward uncertainty may also be a pos- 
sible explanation for failure to support the 
hypotheses tested in this research is further sup- 
ported by the following statement made by one 
of the managers who participated in an inter- 
view phase of this research:'® 


These are the research and product development plans and objectives of R & D managers. 


‘6This refers to the top management levels where the final budget decisions are made. 


These 21 items include 8 items from Hopwood (1972). The additional items were generated from the Interview stage. 


"Hopwood (1972) also argued that the style a manager perceives himself to be evaluated on may be different from that 
actually used by his supervisor in performance evaluation. The result reported here is, in fact, support for Hopwood’s 
argument that there may be disagreement between a manager and his supervisor on the criteria used in performance 
evaluation. 


'*The statement quoted here is typical of the complaints many of the managers who participated tn the interview phase made... 
` 
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TABLE 5. Distribution of correlation coefficients between the responses of managers and their supervisors 
on criteria used in evaluating managerial performance 








Overall 

sample BCS PCS 
Significant and positive correlation 
coefficients 8 6 5 
Significant and negative correlation 
coefficients 1 4 2 
Not significant correlation coefficient 12 11 14 
Total 21 21 21 





Information in this organization is top down ... You do 
not receive information as a matter of routine here. You 
receive them because you fight for them ... they only 
provide you with information when they want to beat up 
on you. 


Many of the managers complained that their 
organizations did not have formalized processes 
~for informing them what they had to do for them 
to be successful performers and, hence, re- 
warded accordingly. 


CONCLUSION 


This research sought to test the two contin- 


gency frameworks suggested by Otley (1978). 
The hypotheses tested were not supported 
suggesting that Otley’s frameworks may not 
adequately explain the differences in results 
between his study and that of Hopwood. 

Two reasons were offered as possible explana- 
tions for failure to support the hypotheses 
tested. One was the financial condition of the 
organizations used in the analysis. The three 
organizations which provided managerial 
employees for this study had experienced 4—8 
quarters of financial losses or declining profita- 
bility prior to the time this current research was 
conducted. Secondly, the managers used in 
analysis may not have been aware of how their 
performances were evaluated and rewarded. 
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Abstract 


Despite the common use of consensus amongst experts as a “surrogate for truth” in experimental studies 
of decision-making, there has been little discussion as to its appropriateness. A new measure of consensus 
for dichotomous predictions, which takes into account experts’ confidence in the correctness of their 
decisions, is proposed and its relationship with accuracy is experimentally evaluated. The experiment also 
extends Ashton’s (The Accounting Review, April 1985, pp. 173—185) empirical work on the conventional 
consensus measure by checking the robustness of Ashton’s results to a different experimental setting (UK. 
trade credit specialists) and to increases in the cues made available to experimental participants. The paper 
concludes that the new measure of consensus complements the conventional measure, in that it provides 
information over and above that provided by the conventional consensus measure. The paper also 
concludes that the results noted by Ashton for the US. also hold for the U.K. and are robust to increases in 


the cue set made available to participants. 


Much of the experimental research into various 
aspects of individual decision-making has had a 
practical aim, namely, the making of “better” 
decisions. Irrespective of how “better” is de- 
fined in any particular context, some form of 
“benchmark” or criterion to determine whether 
or not one decision is superior to another is re- 
quired. Ideally, some “objective” and measur- 
able external criterion such as accuracy or “cor- 
rectness” should be used. However, in situations 
where no objective external criteria exist, as in 
many evaluative and assessment type account- 
ing and auditing decision situations, this is obvi- 
ously infeasible. In these situations, recourse is 
usually made to the level of consensus between 
experts, where consensus is defined as the num- 
ber of agreements between experts. 

The purpose of this paper is to consider the 
appropriateness of consensus, as defined above, 
given the role it is assigned in experimental 


studies. After considering the role of consensus 
in experimental studies, a new measure of con- 
sensus is proposed. Both the conventional and 
mew consensus measures are then experimen- 
tally evaluated using Ashton’s (1985) procedure 
of comparing their relative correlations with ac- 
curacy. The paper also extends Ashton’s (1985) 
empirical evaluation of the conventional con- 
sensus measure in two ways: first, by determin- 
ing the robustness of her results in a different ex- 
perimental setting, and second, by assessing the 
impact of increases in the cue set made available 


_.to experimental participants. 


The paper contains the following sections. 
First, the role of consensus in experimental situ- 
ations is considered and a new measure pro- 
posed. Second, the nature of the experiment 
used for the evaluation of the new measure is 
briefly described. The third section presents the 
empirical findings for both pairwise and indi- 
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vidual definitions of consensus and accuracy. A 
- discussion of the results and conclusions are 
given in the final section. 


“i 


CONSENSUS IN EXPERIMENTAL STUDIES 


Measures of consensus 

The ultimate purpose of much experimental 
work on decision-making in accounting is to 
understand and improve decision-making in 
practice. As a consequence, the correctness of 
decisions made in experiments needs to be 
evaluated. In some decision-making situations 
there is an objective measure, such.as accuracy, 
against which to compare the decisions made by 
participants in an experiment. For example, in 
experimental studies concerned with failure 
prediction, researchers are able to judge the cor- 
rectness of decisions by their knowledge of 
which companies actually failed. However, a 
number of decision situations in accounting are 
evaluative in nature (for example, internal con- 
trol evaluation) and, therefore, recourse has to 
be made to a different measure of the quality of 
decisions. In these types of situations the level of 
consensus between experts is the usual measure 
of decision quality, 


“Because we lack an objective definition of a ‘correct’ 
decision in situations such as materiality judgement, con- 
sensus judgements of experts are often employed as a 
substitute criterion .... In other situations such as inter- 
nal control evaluation, the degree of judgemental con- 
sensus is often used as a substitute measure of decision 
quality; lack of consensus indicating that at least some in- 
dividual’s judgements are incorrect” (Libby & Lewis, 
1982, p. 234). 


However, this conventional measure of con- 
sensus may not provide a good proxy for accu- 
racy in experimental situations, particularly 
where dichotomous predictions are required. 
This is because consensus merely records the 
amount of agreement across experts and does 
not differentiate between agreement over cor- 
rect decisions and agreement over incorrect de- 
cisions. Thus, high levels of consensus need not 
imply “better” or more “correct” decisions since 
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all subjects could be equally wrong. Whether or 
not consensus is an adequate surrogate for truth 
is, therefore, an entirely empirical- matter. 

Moreover, the conventional consensus mea- 
sure appears to incorporate an excessively re- 
strictive view of expertise. Arguably, a richer 
view of expertise would be where experts not 
only make correct decisions but are also reason- 
ably able to judge ex-ante the correctness of 
their decisions. A consensus measure that in- 
cludes experts’ confidences in their decisions 
may, therefore, provide an improved proxy for 
accuracy. Before considering whether this is the 
case empirically, consideration will be given to 
the circumstances where confidence and accu- 
racy might be well matched. f 

First, experimental results (for example, 
Clarke, 1960; Nickerson & McGoldrick, 1965; 
Pitz, 1974; Lichtenstein & Fischhoff, 1977, 
1980a) suggest that the more difficult the ex- 
perimental task, the more likely a subject’s confi- 
dence is to outweigh his/her accuracy. In other 
words the general finding in experimental 
studies that subjects tend to be over-confident 
(for example, Oskamp, 1965; Hazard & Peter- 
son, 1973; Koriat et al., 1980) weakens as the 
task presented to subjects becomes easier. This 
“difficulty” effect has been explained by 
Lichtenstein et al. (1982) as arising from sub- 
jects’ inability to assess the relative difficulty of 
the task facing them. 

A second factor affecting the relationship be- 
tween confidence and accuracy is the amount of 
feedback given to the subjects regarding their 
performance. Research by Adams & Adams 
(1958, 1961) and Lichtenstein & Fischhoff 
(1980a) found that extensive feedkack concern- 
ing subjects’ prior performance improved the 
subjects’ subsequent performance. In contrast, 
Choo (1976) found that a single feedback ses- 
sion did little to improve performance. Einhorn 
(1982) argues, however, 


“outcome information, without knowledge of structure, 
can be irrelevant for providing self-correcting feedback 
about poor heuristics. It has alsó been argued that knowl- 
edge of task structure is difficult to achie~e because of the 
inductive way in which we learn from experience” (p. 
282). 
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How feedback affects the relationship be- 
tween confidence and accuracy is, therefore, de- 
pendent upon the amount of feedback provided 
to subjects and the subjects’ knowledge of the 
structure of the task facing them. In connection 
with this last point, a study by Koriat et al. 
(1980) noted that a task structure that prom- 
oted reflection on the decision to be made, sig- 
nificantly reduced over-confidence. 

Third, it is to be expected that the motivation 
of subjects would affect how their confidence in 
a decision related to their accuracy. In a number 
of situations, for example, standard setting, the 
motivation of subjects could clearly lead to 
under-confident assessments. However, the lim- 
ited existing research on this topic (Sieber, 
1974) indicates that increased motivation does 
not lead to confidence and accuracy being bet- 
ter matched. 

A fourth factor to be considered is the exper- 
tise of the subjects given the task structure at 
hand. Lichtenstein & Fischhoff (1977) found 
that their student subjects were no better “calib- 
rated” (high calibration existing when confi- 
dence and accuracy are well-matched and vice 
versa) when answering questions in their field of 
study as compared to general knowledge ques- 
tions. This lack of an expertise effect on calibra- 
tion is mirrored in numerous field studies of clin- 
ical physicians (Lusted, 1977; DeSmet et al., 
1979; Christensen-Szalanski &  Bushyhead, 
1981). In contrast, studies of weather forecas- 
ters (Winkler & Murphy, 1968; Murphy & Wink- 
ler, 1971, 1974, 1977) and “experts” who set 
odds for ‘horse racing (Dowie, 1976) show the 
experts to have generally excellent calibration. 
Lichtenstein et al (1982) explain the difference 
between the results for physicians and weather 
forecasters by three factors. First, weather 
forecasters are used to making probabilistic fore- 
casts. Second, forecasting the weather is reason- 
ably repetitive, while diagnosing the ailments of 
patients is highly non-repetitive. Third, the out- 
come feedback for weather forecasters is reason- 
ably prompt and well understood. In contrast, 
physicians often will not know for quite a long 
time after, if their diagnoses have been correct. 
This could be due to patients taking their time to 


339 


report back, going elsewhere or the link be- 
tween ailment and symptom not being well-de- 
fined. These results suggest that feedback and 
task structure are important determinants of an 
expert’s calibration capabilities. 

To summarise, on the various factors which 
might be expected to condition the relationship 
between confidence and accuracy, it is only for 
the “difficulty effect” where the bulk of the re- 
search findings tend to point in an unambiguous 
direction, namely, the more difficult the task the 
more likely is a subject to be over-confident. De- 
spite the ambiguity of the conclusions for the 
other factors described above, it seems clear that 
they must be borne in mind when examining 
issues which deal with the relationship between 
confidence and accuracy. 

In addition to the research findings just de- 
scribed, there is some existing empirical work in 
forecasting which suggests that a confidence 
based consensus measure might be more highly 
correlated with accuracy than the traditional 
consensus measure. Dalkey (1969) used the 
confidence of participants in their responses to 
reach more accurate estimates of quantitative 
phenomena in Delphi exercises. Dalkey found 
that by selecting for inclusion in the feedback 
only those responses considered “highly confi- 
dent” by their proponents, a slightly superior 
result was achieved. Dalkey et al. (1970) found 
that in situations in which the “highly confident” 
group was reasonably large, a definitely superior 
result could be achieved. 

Of course, in reality the relationship between 
accuracy and consensus (however defined) is 
complex because it will depend upon the level 
and range of accuracy achieved by experts, the 
degree of agreement across the decisions and 
the correctness of decision-makers’ confi- 
dences. Thus, how a confidence based consen- 
sus measure compares to the conventional con- 
sensus measure is an empirical issue. 


A new measure of consensus 

The experiment used to compare a confi- 
dence based consensus measure to the conven- 
tional consensus measure is concerned with fai- 
lure prediction. Because of the dichotomous 
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nature of the predictions made (fail, not fail) 
Ashton’s absolute measure of consensus, defined 
as the number of cases a pair of subjects agree 
in their predictions, is used to represent the 
conventional consensus measure. In order to 
facilitate comparison with this conventional 
consensus measure, the form of the new pair- 
wise consensus measure to be used here is de- 
fined as follows: 


_ i) ifa pair of experts, j and j/+1, agree on pre- 
diction /, then the new pairwise consensus mea- 
sure for the pair (J, +1) for prediction # will 
equal their average level of confidence, i.e. 


(confidence,, + confidence,, , ;¥2; 


(ii) if a pair of experts, j and j+ 1, disagree on 
prediction 4, then the new pairwise consensus 
measure for the pair ( j,/+ 1) for prediction 7 will 
equal —1 times their average level of confi- 
dence, Le. 


~1*(confidence,,; + confidence, , ;¥2. 


This form of consensus produces a pairwise mea- 

sure for each prediction which can range from: 
100% (when both subjects are 100% confi- 
dent in their decisions but disagree) to + 100% 

(when both subjects are 100% confident and 

agree). 

_ It might be argued that this new measure of 
consensus has an inherent ambiguity, in that it 
gives the same score of zero to pairs of subjects 
which: ; 


3 
(i) agree on prediction but have zero confi- 
dence; or which 


(ii) disagree on prediction and have zero con- 
fidence. 


.The authors would argue however, that there is 
no ambiguity here because (i) and (ii) are the 
same judgement. This is because an expert 
would only give a zero confidence weighting to 
a decision which s/he believed to be random. 
There are, of course, no substantial differences 
between agreement and disagreement over ran- 
dom decisions. 
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SUBJECTS AND THE STUDY TASK 


The experiment used in this paper was also 
designed to tackle other issues, namely, whether 
small company account users were able to use 
post-1981 Companies Act “abridged accounts” 
successfully and whether they were able to 
utilise additional, bank reference, information 
when making predictions. Thus, since the article 
dealing with these issues has already been pub- 
lished (Keasey & Watson, 1986), the details of 
the experiment will only be briefly described 
here. 

The experiment was concerned with failure/ 
non-failure predictions and was similar to the 
one conducted by Kida (1980), the results of 
which were used by Ashton in her study of the 
relationship between accuracy and consensus. 
In terms of conducting the experiment, arrange- 
ments were made with the organisers of the 
1985 U.K Trade Credit Conference for the au- 
thors to give a paper on risk assessment methods 
based on an evaluation of the delegates’ success 
at handling a number of “risk assessment” case’ 
studies. Details of proposed conference sessions 
were included in a circular sent to all members 
of the Institute of Credit Management during the: 
spring of 1985 and delegates were asked to 
nominate the sessions they would attend at the 
forthcoming conference. Eight-two delegates 
elected to attend the authors’ session and were 
duly sent, during the early summer of 1985, a. 
package of risk assessment case studies and 
asked to reply as soon as possible. By early Sep- 
tember, 36 complete and usable replies and a 
further 22 incomplete/apologies‘excuses had 
been received. Early and late respondents were 
analysed for statistically significant differences 
in responses. None were found and the full set of 
36 subjects was used for the analysis described 
in the following sections._. 

Each trade credit specialist was provided with 
three year financial profiles for 20 (10 failed and 
10 non-failed) real but disguised small 
(£10,000-£100,000 total assets), single plant, 
independently owned mechanical engineering 
companies operating in the north-east of Eng- 
land. The participants were informed that the. 
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companies had the above the characteristics and 
were told that companies of this type had a 50% 
chance of failing, where failure was defined as 
ceased trading within three years of the last set 
of accounts. The 50% probability of failure over 
a three-year period is close to the actual base 
rate for this type of company (see Storey et al, 
1987). 

Each financial profile was based upon the 
post-1981 Companies Act “abridged accounts” 
of the companies. These abridged financial state- 
ments, which are the only accounting informa- 
tion increasingly large numbers of small com- 
panies actually lodge at Companies House, con- 
sist only of a balance sheet and accompanying 
notes. From these statements five financial 
ratios, for each of the three years prior to failure, 
were chosen to reflect, as far as was possible 
given the limitations of these financial state- 
ments, the major categories of financial informa- 
tion and common usage/reference in the 
accounting and finance literatures. 

For each of the 20 companies, the subjects 
were asked to estimate on the basis of the infor- 
mation provided whether the company would 
fail within three years of the last set of accounts. 
Subjects were also asked to indicate their degree 
of confidence for each prediction ona 100 point 
scale. To check whether subjects had used the 
confidence scale in a consistent manner, at the 
end of the exercise they were asked to state how 
confident they were, on a 100 point scale, in 
their overall performance. At this point the sub- 
jects were also asked to state how long the task 
had taken them, their age, their years of experi- 
ence and how they felt about the information 
provided given the nature of the task. 

To determine whether the accuracy, consen- 
sus and confidence of the trade credit specialists’ 
decisions were affected by the cue set provided, 
half of the initial sample of 82 subjects were 
supplied with additional information in the form 
of bank references detailing the bank’s assess- 
ment of the financial condition of the company. 
Bank references were chosen as the additional 
cue after discussions with the Institute of Credit 
Management. These discussions indicated that 
the assessment provided by a company’s bankers 


were one of the most important cues used by 
trade credit specialists. Therefore, the addition 
of bank references seemed the obvious way of 
increasing the non-redundant, experimental cue 
set. 

Of the 36 subjects who completed the study, 
18 had been supplied profiles without the bank 
reference information. There were no signifi- 
cant differences (see Keasey & Watson, 1986) 
between the bank reference group and the no- 
bank reference group in terms of age, experi- 
ence, time taken to complete the task and in 
their comments concerning how useful they 
found the information provided. 

Thus, the major differences of the present ex- 
periment as compared to the one used by Ashton 
are, in terms of the size of companies (small as 
compared to large), the profession (trade credit 
specialists rather than auditors) and nationality 
(being U.K. rather than U.S. based) of the par- 
ticipants, the incorporation of subjects’ confi- 
dence in their predictions to develop the new 
form of consensus and the effect upon accuracy, 
consensus and confidence of increasing the cue 
set provided via the control group method. 


ANALYSIS AND RESULTS 


Due to the dichotomous nature of the predic- 
tions made by the subjects in this study, only 
Ashton’s absolute measures of consensus and ac- 
curacy and the new consensus measure are used 
in the analysis. Both pairwise and individual 
measures of accuracy and consensus will be 
used in the following analysis to allow compari- 
son across the arguments of interest. For as 
argued by Ashton (1985) 


“the traditional measures of consensus and accuracy are 
not comparable. Consensus traditionally is expressed as 
agreement between pairs of subjects while accuracy typ- 
ically is measured for an individual subject. Con- 
sequently, to facilitate comparisons between these two 
dimensions of judgement performance, new measures of 
accuracy and consensus were developed” (p. 174). 


Pairwise absolute accuracy is defined as the 
mean accuracy score over the 20 cases for each 
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pair of subjects. Individual absolute accuracy is 
the number of cases (out of 20) that a subject 
made the correct failure/non-failure prediction. 
The pairwise measure for conventional consen- 
sus is the number of cases each pair of individu- 
als agree in their predictions. The pairwise mea- 
sure for the new confidence based consensus is 
as defined in the previous section. Individual 
consensus, for both measures and for both sub- 
groups of participants (bank and no-bank), is the 
mean consensus over the 17 pairs in which a par- 
ticipant is included. 


Summary statistics and t-tests 
Table 1 presents the summary statistics for the 
three measures of interest for both types of 
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is that the additional bank reference cue de- 
creases consensus. 

Accuracy for the no-bank referercce group also 
shows that the subjects are less accurate than 
those analysed by Ashton — the group mean for 
correct predictions being 66% as compared to 
the 84% mean achieved by the auditors in 


‘Ashton’s study. To test for the “environmental 


predictability” of the financial information pro- 
vided, a stepwise linear discriminant model suc- 
ceeded in correctly classifying 75% (70% and 
80% of failed and non-failed companies, respec- 
tively) of the 20 cases. Whilst this level of predic- 
tive accuracy is low compared to the results ob- 
tained in the majority of large company studies, 
the model’s accuracy is greater than could be ex- 


TABLE 1. Summary statistics and t-tests 








No-bank group Bank group 
Mean SD. Mean SD. t-value 
Pairwise analysis N = 153 (in each group) 
Accuracy 13.33 1.20 13.17 1.28 1.19 
Conventional consensus 15.02 2.67 13.04 3.04 6.02* 
New consensus 30.99 15.51 19.06 18.24 6.17* 
Individual analysis N = 18 (in each group} . 
Accuracy 13.33 1.78 13.17 1.92 0.27 
Conventional consensus 15.02 1.46 13.04 1.60 3.88" 
New consensus 30.99 9.93 19.06 10.09 3.58* 





*Significant at 0.01 level. 


analysis. The summary statistics for the no-bank 
reference group and the bank reference group 
indicate that the provision of the additional cue 
of a bank reference does not significantly affect 
accuracy but significantly reduces both mea- 
sures of consensus. In terms of explaining this’ 
result, the existing evidence from psychology is » 
not particularly helpful. If the additional cue is 
redundant or irrelevant, consensus decreases 
(Einhorn, 1971; Well, 1971; Nystedt, 1974). 
However, if the cue is relevant and uncorrelated 
with prior information, consensus can either de- 
crease (Nystedt & Magnusson, 1972; Nystedt, 
1974) or increase (Huff & Friedman, 1967). As it 
is difficult to say whether bank reference infor- 
mation was perceived as being irrelevant or not 
by the subjects, the only conclusion to be drawn 


pected from random choice. The major reason 
for the low level of predictive accuracy being 
the inherent variability of small firm perform- 
ance (Storey et al., 1987) and the well ‘recog- 
nised unreliability of their financial reporting 
(ASC, 1979, 1980, 1983). 


Correlation analysts 

Table 2 shows the correlation coefficients 
between the various arguments for each sub- 
group. For the no-bank reference group, the 
Pearson (Spearman) correlation between accu- 
racy and the conventional consensus measure, is 
positive and significant at a 1% level for the pair- 
wise analysis and at a 5% (10% ) level for the in- 
dividual analysis. Thus, the positive correlations 
reported by Ashton, whilst also evident in this 
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TABLE 2. Correlation analysis (Pearson/Spearman in parentheses) 








T No-bank group Bank group 
Pairwise analysis N = 153 (in cach group) 
Accuracy 0.47} (0.35)4 0.58} (0.56) 
Conventional consensus 0.96} (0.93)t 0.38t (0.24)t 0.96¢ (0.94)t 0.52f (0.50)ł 
New consensus Accuracy New consensus Accuracy 
Individual analysis N = 18 (in each group) $ 
Accuracy 0.524 (0.41 )t 0.73¢ (0.51 )t 
Conventional consensus 0.96} (0.84) 0.43} (0.33)* 0.96} (0.93)¢  0.70ł (0.38)t 
New consensus Accuracy New consensus Accuracy 


(One-tailed ) significance tests for differences in correlation coefficients for the two consensus measures and accuracy 





Pairwise Z=096 P=0.16 Z=0.78 P=0.22 
Individual Z=030 P=0.38 Z=019 P=0.42 
* <= 10% 
$< 5% 
t< 1% 


study, are somewhat reduced. Also, the positive 
relationship between accuracy and consensus 
increases when the cue set is increased. Ashton’s 
results, therefore, appear to be corroborated by 
the results presented here. 

Turning to the central hypothesis of this 
paper, the results of Table 2, indicate that the 
new measure of consensus is, for both groups 
and for both types of analysis, more highly corre- 
lated with accuracy than the conventional con- 
sensus measure. However, as indicated by the 
“Fisher’s transformation of r” z-statistics shown 
in the table (see Glass & Stanley, 1970), the dif- 
ferences in correlation coefficients are not statis- 
tically significant at conventional levels of confi- 
dence. 

Thus, for the group of experts and task consi- 
dered here, the new measure of consensus does 
not appear to provide a markedly better alterna- 
tive than the conventional consensus measure. 
This does not mean, however, that the new con- 
sensus measure does not provide information 
which might be considered complementary to 
that provided by the conventional consensus 
measure. To test whether the new consensus 
measure provided incremental information as 
compared to conventional consensus, a two 
stage regression approach was adopted. The first 
stage derived uncorrelated residuals from a re- 


gression of the new consensus measure on the 
conventional consensus measure. The second 
Stage regressed the accuracy scores on the un- 
correlated residuals from the first stage. This 
procedure provides semi-partial correlations be- 
tween the new consensus measure and accu- 
racy, which measure the unique contribution of 
the new consensus measure aftef the contribu- 
tion of the conventional consensus measure has 
been “partialled out”. The semi-partial correla- 
tions for the no-bank (sr = 0.37) and bank (sr = 
0.29) pairwise analysis groups are significant at 
1% levels of confidence. For the individual 
analysis, however, only the semi-partial correla- 
tion of 0.39 for the no-bank group was significant 
at 5% levels of confidence (sr for bank group = 
0.22). 

The above analysis indicates that the new 
measure of consensus provides statistically sig- 
nificant additional information in respect of exp- 
laining subjects’ accuracy. This suggests, for the 
present experiment at least, that the two mea- 
sures ought not to be considered as either com- 
petitors or substitutes. Rather, they should be 
viewed as complements since they. jointly suc- 
ceed in explaining a statistically significantly 
greater proportion of the variance in subjects’ 
accuracy than the conventional consensus mea- 
sure alone. 


CONCLUDING REMARKS 


In this paper the role assigned to consensus 
within experimental studies of decision-making 
has been briefly considered and a new measure 
of consensus which incorporates experts’ confi- 
dence in their decisions has been proposed. The 
experimental evidence presented in this paper 
shows that, whilst the new confidence based 
consensus measure is not significantly more 
highly correlated with accuracy than the tradi- 
tional consensus measure, it is complementary 
to the conventional consensus measure. The 
single set of results presented in this paper 
should, of course, not be seen on their own as of- 
fering conclusive evidence for the adoption/ 
non-adoption of the new measure. Instead they 
should be seen as an initial set of results from a 
single set of experts. 

The Deore tye also develops the un- 
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derstanding of the relationship between accu- 
racy and consensus in the following ways. First, 
in a more “difficult” decision setting than those 
considered by Ashton, the relationship between 
accuracy and consensus (however defined) is 
reassuringly significantly positive, if to a some- 
what lesser extent than reported by Ashton. 
Second, although the actual level of agreement 
between individuals and pairs of individuals sig- 
nificantly falls when the available cue set is in- 
creased, the positive correlation between accu- 
racy and consensus appears to be strengthened. 
More work dealing with this relationship and the 
redundancy/non-redundancy of cue changes is 
clearly needed. Finally, the findings suggest that 

~ much more work is needed to understand how 
the new measure of consensus might perform 
across a range of decision situations which con- 
trol for the difficulty of the task and expertise of 
the subjects. 
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Abstract 


This study examined the interactive effect of participation and job difficulty on managerial performance 
and work motivation in a budgetary context. Using middle-level managers from six companies working in 
various functional areas as subjects, this study found that managerial performance was high where 
perceived participation was commensurate with perceived level of job difficulty. On the other hand, 
performance was low where participation was not commensurate with the perceived level of job difficulty. 
No significant interactions were obtained with respect to work motivation. f 


The argument that participation in budgeting 
may be positively related to managerial perform- 
ance and motivation is based on the rationale 
that when a goal or a standard set by a participa- 
tive process is accepted, employees: (a) 
internalize the established goal or standard; and 
(b) feel a personal responsibility to achieve it 
(Milani, 1972). Consequently, employees de- 
velop greater motivation and hence attain better 
performance. Researchers also argue that par- 
ticipation may increase employees’ personal 
feelings of “ownership” of the goal or standard 
and the level of their trust in the intentions of 
management (Vroom, 1964; Lowin, 1968; 
Patchen, 1970; Lawler & Hackman, 1979; Raval, 
1982). Others have argued that participation is a 
mechanism for information exchange by which 
individual employees obtain more information 
about their job (Hopwood, 1976). Such addi- 
tional information facilitates a clearer under- 
standing of their job, thereby improving motiva- 
tion and performance (Mitchell, 1973; Schuler 
& Kim, 1976; Schuler, 1980). 





Nevertheless, there is only equivocal evi- 
dence supporting the posited relationships 
between participation and performance and 
motivation. While a number of studies report a 
significant positive relationship between partici- 
pation and performance in a budgetary context 
(Argyris, 1952; Becker & Green, 1962; Brow- 
nell, 1979; Kenis, 1979), other studies did not 
observe such a relationship (French et al., 1960; 
Fleishman, 1965; Milani, 1975; Ivancevich, 
1976; Steers, 1976). Moreover, some studies 
have reported a negative relationship between 
the two (Bryan & Locke, 1967; Stedry, 1960). 
Similarly, equivocal results have been obtained 
with respect to the posited participation—moti- 
vation relationship in a budgetary context. 
While Vroom (1964) and Hofstede (1967), for 
example, reported a significant positive relation- 
ship between participation in budgeting and 
motivation, Brownell & McInnes (1986) did not 
observe such a relationship (see also Staw, 1977; 
Ivancevich, 1979). 

One possible reason for the equivocal results 


*I am indebted to Peter Brownell, Macquarie University, Ken Ferris, Southern Methodist University and Ron Weber and Peter 
Ziegler, Queensland University for their comments on an earlier version of the paper. Comments by participants in seminars 
on the paper at University of Otago, University of New South Wales and at Canterbury University are apprectated. I am also 
indebted to the anonymous reviewers for their useful comments and suggestions on earlier drafts of the paper. 
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reported in previous studies is that the relation- 
ships may be moderated by contingency (situa- 
tional) variables (see Brownell, 1981). Hop- 
wood (1976), for example, suggests that the 
moderating role of contingency variables, such 
as job characteristics, personality characteristics 
and social conditions, be considered while 


studying participation in budgeting. A review of . 


the literature reveals that a number of studies 
have investigated the relationship between 
budgetary participation and employee 
behaviour while considering the contingent role 
of personality variables. Brownell (1981), for 
example, reported that locus of control (a per- 
sonality variable) moderated the relationship 
between job: satisfaction, performance and 
budgetary participation. Previously, Vroom 
(1959) and Hofstede (1967) reported that per- 
sonality variables such as authoritarianism and 
reed for independence moderated the relation- 
ship between motivation, performance and par- 
ticipation in decision-making. 

No studies, however, have addressed the 
possible moderating role of a job characteristic 
such as job difficulty on the relationship be- 
tween budgetary participation and employee 
behaviour, particularly at the individual level 
(ie. using individual employees as the ‘unit of 
analysis ). This study, conducted in a field setting, 
investigates such a moderating role by examin- 
ing how job difficulty interacts with participa- 
tion to influence managerial performance and 
motivation. ` 

The remainder ofthis section discusses the 
moderating effects of job difficulty and develops 
the hypotheses to be tested. Subsequent sec- 
tions focus on the method of the study, the 
results and a discussion of the results, limitations 
and conclusions. 


JOB DIFFICULTY AS A MODERATING 
VARIABLE 


Two important characteristics of a job or task 
(henceforth, job) are its difficulty and variability 
(Van de Ven & Delbecq, 1974). A job may be 
difficult because of its 


complexity, - 
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heterogeneity, unpredictability, lack of a pre- 
scribed performance procedure or because ofits 
changing ‘operational technology (Perrow, 
1967). A job may also be difficult because of its 
variability. Indeed, Van de Ven & Delbecq 
(1974) argue that when a job becomes less dif- 
ficult as a result of learning, it is likely to become 
less variable because a body of knowledge may 
have been developed specifying a prescribed 
way to perform it. Routineness is another im- 
portant characteristic ofa job. A routine job may 
be programmed by the prescription of rules and 
procedures which facilitate performance (Ait- 
ken, 1960; Fullan, 1970) and, as a consequence, 
the job becomes less difficult. Thus, difficulty is 
an important job characteristic and it may be a 
surrogate for other important job characteristics 
such as variability and routineness. 

A potential moderating effect of job difficulty 
on the participation—performance and participa- 
tion—motivation relationships appears to be sup- 
ported by the extant literature. In a high job dif- 
ficulty situation, more information is required at 
the point of task. execution (Fry & Slocum, 
1984). Managers operating in such a situation 
may not have all the necessary information to 
perform the job; they may need to obtain and 
process additional information to understand 
the job clearly, thereby improving their chances 
of successful performance and obtaining desired 
rewards. 

Participation as a mechanism for information 
exchange may enable managers to obtain and 
process- additional information about their job 
(Hopwood, 1976; Galbraith, 1977). Such addi- 
tional information may facilitate understanding | 
of their job requirements, thereby increasing 
their confidence (expectancy ) that efforts made 
to perform the job may be successful. Under 
expectancy theory of motivation (see Vroom, 
1959; House, 1971; Lawler & Suttle, 1977; 
Ferris, 1977, 1978), an increase in the expec- 
tancy that effort leads to successful performance 
(E->P) may increase managers’ motivation to 
perform their job. Thus, it can be argued that 
participation in budgeting may improve mana- 
gers’ motivation and performance particularly in 
a high job difficulty situation. Hopwood (1976, 
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pp. 83-85), for example, argues that in the case 
of a high job difficulty situation, participation in 
budgeting may offer greater benefits, than a non- 
participative style. 

In a low job difficulty situation, on the other 
hand, it is possible to develop and apply pre- 
determined rules, policies and standards for the 
performance of a job (Tushman & Nadler, 
1978). Thus, individual employees operating 
under these conditions may be able to perform 
the job only by knowing the prescribed rules, 
policies and standards; they may not need to 
obtain and process additional information for 
this purpose and, therefore, have less need for 
participation. If work is routine (less difficult), 
most decisions involving the task are likely to be 
routine hence, managers will be more motivated 
and perform better if their time is not wasted by 
participation in decisions with obvious solutions 
(see Strauss, 1982). 

Based on the discussions above, it can be 
argued that the relationships between participa- 
tion and performance and between participation 
and motivation are contingent upon an appro- 
priate fit between participation and job diffi- 
culty. A highClow) level of participation under a 
high(low) job difficulty situation (the appro- 
priate fit) is likely to have a more positive 
(strong) relationship with managerial perform- 
ance and motivation. The converse is expected 
when the appropriate fit does not occur (see Fig. 
1). The hypotheses stating this proposition are 
as follows: 


H. The higher the perceived job difficulty (X,), the more 
positive is the relationship between participation in 
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budgeting (X,) and managerial performance (Y). In 
mathematical terms, dY/dX, is greater when X, increases. 


H The higher the perceived job difficulty (X,), the more 
positive is the relationship between participation in 
budgeting (X,) and motivation (Af). In mathematical 
terms, dM/dX, is greater when X, increases. 


The hypotheses are summarised in Fig. 1. 


METHODS 


Research stte and sample 

Data for the study were collected using a sur- 
vey questionnaire from middle-level managers 
working in six companies operating in New Zea- _ 
land. The number of people employed by each of 
these companies was between 100 and 1500. Of 
these six companies, one manufactures food 
items, one makes furniture and wood products, 
one assembles automobiles, two provide retail 
services and one provides banking services. 

Discussions in preliminary meetings with the 
managements of these companies indicated that, 
for several years, these companies have been: (a) 


‘using comprehensive budgeting systems; and 


(b) interested in encouraging a participative 
style of budgeting in all departments/sections. 
Moreover, it was also evident from the discus-' 
sions that: (i) employees needed time to under- 
stand and follow various policies of the com- 
pany; and (ii) an employee holding at least a 
middle-level management position had 
adequate opportunities to participate in setting 
and implementing his or her (henceforth, his) 





Fig. 1. The model of the study. 
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departmental/sectional budgets and goals. The 
two criteria for selecting participants were: 


(1) the individual had been working for the 
company for at least six months prior to the 
‘study; and 

(2) the individual was holding a management 
position and was responsible for preparing and/ 
or implementing the budget or target for his 
department/section. 


The managements of the six companies pro- 
vided 93 participants from among the managers 
who were available at the time of the study and 
who met the two selection criteria.' 

Each participant was sent a questionnaire 
together with a letter explaining the objective of 
the study and assuring anonymity of his 
responses. Each participant was provided with a 
self-addressed prepaid envelope to return the 
questionnaire directly to the researcher. 

Of the 93 managers selected for the study, 76 
fully completed and returned the questionnaire 
yielding a response rate of 81.7%. These mana- 
gers were from production (20% ), sales (16% ), 
marketing (6% ), accounting (18% ), import and 
purchasing (13% ), financing (10% ), personnel 
(10% ) and MIS (7% ). On average, the respon- 
dents had been employed for five and a half years 
by their respective companies and supervised, 
on average, 13 subordinates. 


Measurement of variables 

The questionnaire obtained measures for 
three variables: participation in budgeting, job 
difficulty and work motivation. 


Participation in budgeting Participation 
refers to the amount of influence and involve- 
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ment that an individual employee perceives he 
or she has on a jointly-set budget (Milani, 1975; 
Brownell, 1979). Milani’s participation con- 
tinuum scale was used to assess the employee’s 
perceived amount of participation in budgeting. 
Since Milani’s measure has been widely used, the 


‘results of this study can be integrated into the 


existing body of literature (Brownell, 1982). 

Examples of items included in Milani’s scale are: 

“Which category below best describes your 

activity when the budget is being set? I am 

involved in setting:” and “How much influence 

do you feel you have on the final budget?”. The 

managers rated their level of perceived partici- 

pation in budgeting under each of the 6 items on 

a 7-point Likert-type scale anchored at both 

ends. A managet’s overall score for participation 

was the sum of scores for the 6 items. A check of 
internal reliability yielded a Cronbach (1951): 
alpha coefficient of 0.91 for this scale. Moreover, 

a factor analysis of the scores yielded one eigen- 

value greater than unity, hence, one factor was 

extracted suggesting that the scale was uni- 

dimensional. The factor accounted for 69.9% of 
the variance. Table 1 presents the factor load- 

ings, eigenvalue and the percentage of explained 

variance. 


Job difficulty. This variable refers to the 
degree of complexity of the search process in 
performing a job, the amount of thinking time re- 
quired to solve job-related problems and the 
body of knowledge that provides guidelines for 
performing the job (Perrow, 1967). Job diffi- 
culty was measured using a 7-item Likert-type 
scale developed by Van de Ven & Delbecq 
(1974). Examples of items included in this 
measure are: “To what extent is there an under- 


'The company managements agreed to select the participants for the study using the two selection criteria only. This was 
achieved by making the managements fully aware of the rationale for the selection criteria and the importance of using those 
criteria in selecting the participants. It is believed that participants having different personality characteristics had a fair 
chance to be included in the sample. Thus, the research design is expected to be randomized for personality characteristics. 
One may also argue that since the participants had been working for their company for a fairly long time (five and a half years 
on the average), they might have become used to the particular style of management in the company. Consequently, the ' 
possible influence of their individual personality characteristics on the effectiveness of their participation in budgeting might 
have become neutral. Since data for this study were collected from companies operating in New Zealand only, the results may 


be specific to New Zealand culture. 
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TABLE 1. Factor analysis of participation and job difficulty 











responses 
Participation Job difficulty 
Loadings Loadings 
Items factor 1 Items factor 1 
1 0.86 1 0.59 
2 0.73 2 0.55 
3 0.71 3 0.61 
4 0.90 4 0.63 
5 0.89 5 0.72 
6 0.94 6 0.65 
7 0.70 
Eigenvalue 4.19 Eigenvalue 2.85 
Variance Variance 
explained 70% explained 41% 





standable sequence of steps that can be followed 
in doing your work?” and “On average, how 
much actual thinking time do you usually spend 
trying to solve such specific difficult problems?”. 
Managers rated their perceived level of job diffi- 
culty on a 7-point scale anchored at both ends. A 
manager’s overall score for job difficulty was the 
sum of his or her scores for the 7 items. A check 
of internal reliability yielded a Cronbach (1951) 
alpha coefficient of 0.75 for this scale. Moreover, 
a factor analysis of the scores yielded one éigen- 
value greater than unity, hence, one factor was 
extracted suggesting that the measure was uni- 
dimensional. The factor accounted for 41% of 
the variance. Table 1 presents the factor load- 
ings, the eigenvalue, and the percentage of 
explained variance. 


Work motivation. An employee’s work moti- 
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vation refers to a joint function of his or her 
expectancy that effort (F) towards a job per- 
formance (P) will lead to the successful per- 
formance of the job (i.e. E->P), his or her expec- 
tancy that successful performance of the job will 
lead to attainment of some outcomes (ie. P->O) 
and his or her valence (V) or desirability of those 
outcomes (Ferris, 1977). The construct was 
measured using the scale developed by Lawler & 
Suttle (1973). Part one of the questionnaire con- 
sisted of 20 expectancy items: 2 items measured 
(E->P) expectancies and 18 items measured (P- 
>O) expectancies. For (E->P) expectancies, ef- 
fort was represented by the expression “working 
hard”, for (P->O) expectancies, performance 
was represented by expressions such as “high 
productivity” and outcome by expressions such 
as “promotion”. Each of the expectancy items 
called for a response on a 7-point Likert-type 
scale ranging from never (1) to always (7). Part 
two of the questionnaire consisted of 20 items 
measuring the valence (desirability) of the 20 
task outcomes. Respondents rated their valence 
on a 9-point Likert-type scale, ranging from ex- 
tremely undesirable (1) to extremely desirable 
(9). The 20 outcomes included both intrinsic 
and extrinsic rewards. The overall score for a 
manager’s motivation was determined by multi- 
plicatively combining his or her scores under 
the expectancy and valence items.” The Lawler 
& Suttle’s instrument has been shown to be reli- 
able, and it has been widely used (Ferris, 1977). 
An internal reliability check yielded Cronbach 
(1951) alpha coefficients of 0.71 for the mea- 
sure of (E->P) expectancy, 0.82 for (P->O) ex- 
pectancy and 0.72 for valence (V); these results 


Work motivation: M=(B->P)2[(P->OXV)] where (E->p) = effort leads to performance expentancy, (P->O) = 
performance leads to outcome expectancy and (v) = valence or desirability of outcome or reward. This study used a 
between-subjects analysis of motivation scores. An alternative to this method would be a within-subjects analysis (Johnson, 
1979). However, following Ferris (1977, 1979) and Jiambalvo (1979), a between-subjects analysis method was used here for 
two reasons. First, this method is comparatively easy to use and understand. Jiambalvo (1979, p. 448), for example, argues 
that “Research using a within-individuals analysis becomes cumbersome very quickly”. Second, empirical evidence shows 
that a within-subjects analysis is not superior to a between-subjects analysis. Examples of studies supporting this argument 
are Parker & Dyer (1976), Kopelman (1979) and Murray & Frazier (1986) (see also Ferris, 1979). It is expected that 
individual employees have a desire to obtain both intrinsic and extrinsic rewards (intrinsic and extrinsic valences) for, say, 
good job performance. Consequently, treating the two valences separately in constructing motivation scores would result 
in an incomplete explanation for an individual's motivation. Moreover, the relationship, if any, between employees’ 
participation and their extrinsic valences is not theoretically clear (see Ronen & Livingstone, 1975; Brownell & Mcinnes, 
1986), 


352 


compare favourably with previous results? (Fer- 
ris, 1977). 


Managerial performance. Performance in 
this context refers to the degree of successful 
role achievement attained by an employee 
(Ferris, 1977). Although previous studies. of 
budget participation have used ‘self-ratings 
(Brownell, 1979, 1982; Merchant, 1981), the 
current study used superiors’ ratings of perform- 
ance for two reasons. First, self-ratings are likely 
to have a higher leniency error (higher mean val- 
ues) and a lower variability error (restricted 
range) in the score (see Prien & Liske, 1962; 
Thornton, 1968). More specifically, the likeli- 
hood that a manager will rate his or her own un- 
satisfactory (below average ) performance as un- 
satisfactory is at least low if not nil. Second, the 
superiors’ ratings are likely to provide a wider 
‘(more global ) evaluation of managerial perform- 
. ance (Heneman, 1974). - 

~ The list of participants provided by manage- 
ment of the six companies included the name 
and address of each participating manager’s 
superior and the period for which the particip- 
ant had been working under his or her current 
superior. At least a period of three months was 
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requested to accurately evaluate the overall per- 
formance of the manager reporting directly to 
him or her using a 9-point Likert-type scale 
ranging from (1) well below average to (9) well 
above average performance. 

Each of the respondent superiors was assured 
of anonymity and informed that the information 


collected would be used only for the purpose of 


the study.’ Superiors’ ratings for 62 managers 
were obtained, hence the final sample size was 
62 for the analysis of the performance data and 
76 for the motivation data. 


RESULTS 


Table 2 presents descriptive statistics and 
Table 3 the correlation matrix for the variables 
utilized in this study. As there is no significant 
multicollinearity amongst the variables as 
shown in Table 3 (see Nie et al., 1975, p. 340), 
regression analysis was used to test hypotheses 
H, and H, (Blalock, 1972; Schoonhoven, 1981). 
Moreover, the multiplier interaction model pre- 
sented in equation (1) is efficient in testing the. 


TABLE 2. Descriptive statistics for the variables 
































considered necessary for a superior to form an Actual Theoretical 
accurate opinion about his or her subordinate’s Variables, Means SD. range range 
performance. Five of the 76 managers (who fully Performance 5.15 2.82 2-9 1-9 
completed and returned the questionnaire) did Motivation 39.06 16.90 0-90 na 
not meet the above criterion. The superior of Participation 29.12 8.71 9-40 6-42 
each of the remaining 71 managers was Jobdifficulty 21.26 497 1033 7-35 
TABLE 3. Correlation matrix for the variables 

Y M Xı X, XXa 
Y Performance 1.00 
M Motivation 0.10 1.00 
X, Participation 0.18 0.16 1.00 
X: Job difficulty —0.21" 0.01 0.02 1.00 
XX Interaction 0.314 0.03 —0.08 —0.23° 1.00 
* p<0.05 l 
+ p<0.01 





Since the measure of motivation involves a multiplicative model (see note 2), a factor analysis of the responses would not 


be useful. 


“The superiors were also requested to evaluate performance of a participating manager only if they (superiors) could do it 
without any personal prejudice. Superiors of six participating managers refused to evaluate their (manager's) performance 
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hypothesized interaction effect (Khandwalla, 
1974; Allison, 1977; Brownell, 1981, 1982; 
Schoonhoven, 1981; Govindarajan, 1986; 
Chenhall, 1986). 


Hypothesis H, was tested using the regression 
model: 


Y=a+ bX, + bX, + bX; X: +e. (1) 
where Y is managerial performance, X,, partici- 
pation in budgeting, X, job difficulty and XX; is 
interaction. 


To permit an acceptance of hypothesis H,, the 
coefficient b, in equation (1) is required to be: 
(a) significant, indicating a significant interac- 
tion between participation and job difficulty 
affecting performance; and (b) positive to sup- 
port the direction of the hypothesis. 

The results presented in Table 4 reveal that 
the coefficient b, was positive and significant (p 
< 0.04). The prediction model in equation (1) 
explained 13.83% of the variance (R° = 13.83% 
(adjusted R? = 9.4% ), F = 3.10, p < 0.03) in per- 
formance. Therefore, the results indicate that 
budgetary participation and job difficulty 
interacted at a significant level to positively im- 
pact managerial performance. Thus, H, was ac- 
cepted. 


Figure 2 presents a graphical representation of 
the results to enable a clearer understanding of 
the significant interaction between budgetary 
participation and job difficulty in relation to 
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managerial performance. Drawing the figure 
involved taking the partial derivative of equation 
(1) over X, and then plotting the joint effect of 
the main and interaction terms (see Southwood, 
1978; Schoonhoven, 1981; Govindarajan, 
1986). Figure 2 indicates that the higher the job 
~ difficulty, the greater the positive impact of par- 
ticipation on performance (see Schoonhoven, 
1981, p. 364). 
Hypothesis H, was tested using a model iden- 
tical to equation (1) by substituting the motiva- 


Y = Performance 
X1 = Participation 


1.0 





Job Difficuity{X2} 


whera = by + b3XQ 


from Y = a+ bjX} + b2X2 + b3X]X2 + @ 


Fig. 2. Significant interaction between participation and job 
difficulty affecting performance. 


TABLE 4. Test of hypothesis H,. 
The interaction between participation and job difficulty affecting performance 


—=—--Variables --- -= ----— 
X, Participation 
X: Job difficulty 
X,X2 Interaction 
Constant 





Level of 
— Coefficient —- T*value  -. significance(p) 
0.24 1.22 ng 
—0.39 1.16 ns 
0.69 2.05 <0.04 
6.41 39.27 <0.001 





R* = 13.83% , (3,58) = 3.10, p<0.03. 
n= 62. ‘ 











*One-tailed test. 
tAdjusted R? = 9.4%. 
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tion (M) data for performance (P) data. The 
results were not significant and, therefore, hypo- 
thesis H, was rejected.’ An implication of the 
results is discussed later in the paper. 


DISCUSSION, LIMITATIONS AND 
CONCLUSIONS 


As expected, the proposed interaction 
between participation and job difficulty affect- 
ing performance was significant. Performance 
was found to be high when the amount of partici- 
pation was proportionate to the level of job diffi- 
culty. In contrast, performance was found to be 
low when the amount of participation was dis- 
proportionate to the level of job difficulty. Thus, 
a congruence (good fit) between the level of 
participation and job difficulty appeared to be a 
pre-condition for effective participation (in 
. terms of performance). 

These results support the argument that 
employees may find participation useful when 
they perceive their job difficulty to be high and 
when their perceived level of job difficulty is 
low, they may find participation relatively 
ineffective. This study, therefore, offers one exp- 
Janation for the equivocal results of previous 
studies which reported a negative or no relation- 
ship between participation and performance 
(Blumenfield & Leidy, 1969; Milani, 1975; Bryan 
& Locke, 1976; Kenis, 1979; Steers, 1979). The 
level of participation in budgeting by respon- 
dents in those studies might have been dispro- 
portionate to the level of perceived job diffi- 
culty. Similarly, this study offers one explanation 
for the results of previous studies which re- 
ported a positive relationship between partici- 
pation and performance (Schuler & Kim, 1979; 
Schuler, 1980; Merchant, 1981; Brownell, 
1982), the level of participation in budgeting by 
respondents in those studies might have been 
proportionate to the level of job difficulty they 
perceived. , 

The results for work motivation are dis- 
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appointing but not surprising. Research results 
for motivation have been equivocal. While some 
studies (e.g. Patchen, 1970; Siegal and Ruh, 
1973; Collins, 1978; Kenis, 1979) have reported 
a significant positive relationship between par- 
ticipation and motivation, other studies (e.g. 
Staw, 1977; Ivancevich, 1979; Brownell, 1983) 
observe no such relationship. More recently, 
Brownell & McInnes (1986) could not obtain a 
significant positive relationship between partici- 
pation and motivation. One possible explanation 
for such equivocal results is the apparent lack of 
a consistent approach to measuring motivation 
in previous studies investigating the participa- 
tion—motivation relationship (see Brownell & 
McInnes, 1986). The approach to defining and 
measuring motivation in the current study is 
consistent with that taken by Brownell & McIn- 
nes (1986). Both these studies used an expec- 
tancy model for defining, and the scale 
developed by Lawler & Suttle (1973) for 
measuring motivation. The insignificant results 
for motivation obtained in the current study are 
therefore consistent with those reported by 
Brownell & McInnes (1986). Furthermore, 
based on the results of this study, it can be 
argued that perceived job difficulty does not act 
as a contingent variable to moderate the partici- 
pation—motivation relationship. Thus, the 
results of the study attempt to improve our un- 
derstanding of the lack ofa relationship between 
participation and motivation. 

With respect to the limitations of the present 
study, several points are worth: mentioning. 
First, the managements of the six companies 
selected the participants for the study and, 
hence, the sample was not strictly random. Con- 
sequently, some caution is necessary before 
generalizing the results. Second, the regression 
model of performance (equation (1)) explained 
a relatively low level of variance (R? = 13.8% ) 
indicating that variables other than budgetary 
participation and job difficulty are at work affect- 
ing managerial performance. Future research 
needs to pursue this issue further. 


Hypothesis H, was tested using an alternative formulation of motivation, Le. M=2[(B->O \ V)] as suggested by Ferris (1977, 
p. 606). The results were not significant and hence, rejection of hypothesis H, was further confirmed. 
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Notwithstanding such limitations, the results 
of this study might help to reconcile the results 
reported by previous studies of participation in 
budgeting. The study therefore makes some con- 
tribution to improving our understanding of the 
effect of budgetary context. For practising mana- 
gers, the results suggest that participation in 
budgeting as a tool for improving managerial 
performance should be used selectively. In cases 
where perceived job difficulty is high, encourag- 


ing managers to participate in budgeting is likely 
to improve their performance. On the other 
hand, in cases where perceived job difficulty is 
low, encouraging managers to participate in 
budgeting is unlikely to improve their perform- 
ance. In such cases; prescribed rules, pro- 
grammes, policies and standards may be more. 
effective in the pursuit of improved perform- 
ance. 
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Abstract 


Within the academic and professional auditing communities there has been growing concern with 
accurately assessing the various risks associated with the performance of an audit. One approach to 
developing sophisticated risk assessment models is to study how experienced expert auditors use industry 
and firm specific factors in making audit judgments. This study presents a model of inherent risk assessment 
based on literature reviews and a field study that involved structured and unstructured interviews and 
observations of experts in audit planning meetings. Analysis of the data gathered led to the specification of 
a conceptual model of inherent risk assessment which has been implemented as a computer program (a 
computational model). Auditors were asked to assess the behavior and performance of the computational 


model as a first step in evaluating the expert model. 


Over the last five years there has been a growing 
interest in using computer based cognitive mod- 
els to study the audit process. The state of the art 
of these research efforts has been reviewed by 
` Messier & Hansen (1986). This paper describes 
progress to date on a research project to develop 
conceptual and computational (computer pro- 
gram) models of inherent risk assessment in 
audit planning. The project is based on an analy- 
sis of the various phases involved in a complete 
audit process as opposed to isolated audit tasks. 


This global perspective makes it more likely that. 


specific models developed as a part of this re- 





search will integrate readily into a comprehen- 
sive model of the entire audit process. 
Operationalizing the conceptual model of in- 
herent risk in terms of a computational model 
has several advantages for studying and model- 
ing audit decision-making. First, the computer 
program is a precise and testable expression of a 
human information processing model. Another 
advantage is that computer programs also allow 
for comprehensive testing of a model under dif- 
ferent conditions of the task environment in that 
they can be run with data sets that vary in con- 
trolled ways. In addition, the program can be 


*Funding for this project was provided by the Peat Marwick Foundation through their Research Opportunities in Auditing 
Program (Project Number 84-118) and by a Faculty Research Grant of the Graduate School of Business of the University of 
Pittsburgh. We are especially grateful for the many hours contributed by the professional accountants who formed the basis 
for our conceptual model of risk analysis. We have benefited from the useful comments of Terry O'Keefe and two anonymous 


referees. 
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built to provide traces of its own behavior. In 

this way, patterns of behavior can be studied and 

compared to human behavior under controlled 

conditions. A pragmatic benefit of developing a 

computer model is the possibility of using such 

a model to support human judgment. If the pro- 

gram incorporates the expertise of experienced 

auditors and has access to an accumulating store: 
of client data, it can be used to make audit judg- 

ments for validating or augmenting judgments of 
auditors on specific cases. 

This research project has resulted in the de- 
velopment of conceptual and computational 
models of inherent risk assessment in auditing. 
Inherent risk assessment was identified as an ap- 
propriate starting point in the overall audit pro- 
cess after a series of loosely structured inter- 
views with auditors and a review of professional 
and academic literature. Analysis of the data 
gathered in these initial efforts led us to con- 
clude that audit planning constitutes an ex-ante 
rehearsal of the entire audit. process. Within 
planning, inherent risk assessment was iden- 
tified as the central activity involving judgment 
that forms the basis for audit planning. - 

The remainder of this paper is divided into 
five sections. First, we discuss the concept of in- 
herent risk assessment in auditing by reviewing 
the extant literature. In the third section we dis- 
cuss the observation techniques we have used in 

_attempting to understand the audit process and, 
- more specifically, inherent risk evaluation. This 
is followed by a description of the conceptual 
model of inherent risk assessment. The fifth sec- 
tion presénts the computational model that em- 
bodies the basic features of the conceptual 
model. The sixth section discusses auditors’ 
reactions to the initial model. We conclude with 
a summary of the current state of our project and 
- future research that we intend to conduct as part 
of this project. l i 


INHERENT RISK ASSESSMENT 
An important first step to developing a model 


of inherent risk assessment is to determine the 
requirements it places on the auditor.-We began 
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by reviewing the extant literature to determine 
what these task requirements were. The litera- 
ture can be classified into two main groups: pre- 
scriptive (i.e. how auditors should perform inhe- 
rent risk assessment) and descriptive (i.e. how 


- auditors actually perform inherent risk assess- 


ment). The prescriptive literature includes pro- 
nouncements of the AICPA and audit firm manu- 
als. The descriptive literature includes several 
academic studies. 


Prescriptive literature 


Inherent risk assessment is part of a larger 
problem of audit risk assessment. Both these 
concepts have been discussed widely in the pro- 
fessional and academic literature. Cushing & 
Loebbecke (1983) provide a detailed discussion 
of the historical development of the risk model, 
a review of the literature dealing with the ap- 
propriateness of the model and a discussion of 
the problems that-auditors might encounter in 
trying to implement it. The majority of this liter- 
ature has focused on the AICPA’s audit risk 
model as presented in SAS 47: 


‘ AR = IR* CR * DR, 
where AR is audit risk, IR is inherent risk, CR is 
control risk and DR is detection risk. 


Inherent risk is the susceptibility of an ac- 
count balance or class of account balances to 
error that could be material assuming that there 
are no related internal accounting controls. Con- 
trol risk is the risk that this error in the account 
balance will not be caught by the client’s inter- 
nal control system. Detection risk is the risk that 
any error not detected by the control system 
will not be detected by the audit procedures, 
thereby affecting financial statements. The 
AICPA emphasizes that the above model of audit 
risk is a purely conceptual one, and says little 
about whether or how its various components 
can be measured. ; 

According to the AICPA, the risk assessment 
process should occur during audit planning. Au- 
ditors should determine an acceptable level for 
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audit risk, assess the levels of inherent risk and 
control risk and then determine the level of de- 
tection risk based on these assessments. Au- 
ditors should assess the inherent riskiness of 
specific accounts by reviewing a variety of fac- 
tors that are specific to the client, the client’s in- 
dustry or the economy in general and by deter- 
mining the impact of these factors on individual 
accounts. The purpose of this assessment is to 
help determine the nature, timing and extent of 
tests of the client’s internal control systems and 
transactions (AICPA, 1985). 

The audit approaches of several Big Eight CPA 
firms call for the assessment of inherent risk at 
two levels during the audit planning process: the 
engagement level and the account level (Arthur 
Andersen and Co., 1983; Elliott, 1983; Grobstein 
& Craig, 1984; Peat Marwick International, 
1985). At the engagement level, inherent risk as- 
sessment is used to consider whether the client 
is too risky to retain and to provide some per- 
spective on acceptable levels of individual ac- 
count risk. At the account level the main con- 
cern is-assessing the degree of risk associated 
with a given account balance in the general 
ledger. The primary emphasis of these ap- 
proaches is on assessing risk at the individual ac- 
count level. 

The professional literature also provides lists 
of factors that should be considered in assessing 
inherent risk. Peat Marwick International’s 
(1985) audit manual, for example, presents a 
representative list: 

(1) monetary amount associated with the ac- 
count; te ao 

(2) susceptibility of asset to theft; 

(3) complexity required to determine 
amounts to be entered in the account; 

(4) degree of management judgment involved 
in valuing the account; l 

(5) degree to which external events affect val- 
ues in the account; 

(6) past history of error; 

(7) degree to which clients financial condi- 
tion motivates management to misstate the 
amount in the account; and 

(8) experience of the personnel performing 
accounting functions involving the account. 


‘Descriptive approaches 


The descriptive approaches to inherent risk 
assessment in the academic literature have at- 
tempted to examine the cognitive ability of au- 
ditors to implement the risk model and to deter- 
mined what factors auditors believe to be im- 
portant in the assessment of inherent risk. 


Cognitive abilities. Boritz et al (1986) note 
that the risk evaluation task as characterized by 
the risk model can be represented as a problem 
of decision-making under risk. Their benchmark 
for cognitive ability in such a task is the subjec- 
tive expected utility (SEU) model. Based on an 
extensive discussion of the audit judgment liter- 
ature, they conclude that there is substantial evi- 
dence that auditors do not conform to the SEU 
model when performing these judgments. 

The result of two other studies provide con- 
flicting opinions of the auditor’s ability to make 
the kinds of judgments required by the audit risk 
model. Jiambalvo & Waller (1984) concluded 
that *... auditor’s intuitive combination of the 
risk componets did not correspond closely with 
the kind of combination dictated by the audit 
risk model of SAS 39”. However, Libby et.al 
(1985) concluded that “[i]t is refreshing that au- 
ditor subjects’ performance so closely matched 
predictions from the audit risk model.” A major 
limitation of audit judgment research is that the 
tasks that auditors are asked to perform often 
bear little relationship to the tasks that they per- 
form in actual practice. Since judgments may be 
sensitive to even small changes in the task envi- 
ronment (Einhorn & Hogarth, 1981), the use of 
unrealistic judgment tasks could explain many 
of the divergent findings in the literature. 


Factors that affect inberent risk. Boritz et al 
(1986) gave their subjects four case studies for 
which ten audit planning judgments had been 
made and asked the subjects to review and 
evaluate these judgments, to provide a revised 
set of judgments based on their evaluation and to 
rate the difficulty of each judgment. One of the 
ten judgments was an assessment of the inherent 
riskiness of the client. They found that inherent 
risk assessments were sensitive to liquidity prob- 
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lems, inventory levels, strength of internal con- 
trols, history of errors and fraud and whether the 
audit is connected with an initial public offering. 
They concluded that inherent risk judgments 
have a broader interpretation than that implied 
by the definitions provided in professional 
standards. They indicated that one reason for 


this broader interpretation by their subjects- 


might be that the subjects were asked to assess 
the inherent riskiness of the audit whereas such 
assessments are usually based on individual ac- 
counts. 

Gibbins & Wolf (1982) used an extensive 
questionnaire to elicit from auditors sets of 
items that were the best predictors of whether 
significant audit problems would be encoun- 
tered for a hypothetical audit. The auditors were 
also asked to list other items that were highly re- 
lated to the predictive items. The auditors were 
given a set of items from which to choose but 
were also allowed to add items of their own. The 
Gibbins & Wolf study dealt with five stages of the 
audit process of which planning was one. The re- 
sponses that relate to the planning stage are most 
relevant to this research. The items that were 
identified most often can be classified as either 
client factors or audit firm factors. Client factors 
include the quality of the accounting system and 
internal controls, the competence of top man- 
agement, the accounting staff and the internal 
audit staff, plans for major financing activities, 
financial condition and profitability and the 
client’s direct interest in the conduct of the 
audit. Audit firm considerations include avail- 
ability and continuity of staff resources, fee con- 
siderations, audit approach and the competence 
of the audit manager and field staff. 

Although the Gibbins & Wolf task was not to 
specifically assess inherent risk but to identify 
factors that could contribute to potential prob- 
lems in the audit, the two task are similar enough 
so that their results can be used to provide some 
insight into the inherent risk assessment pro- 
cess. ; 


Summary of risk model implications 
The conclusions most relevant to this re- 


search that can be drawn from the audit risk lit- © 
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erature involve the identification of factors that. 
affect inherent risk and the professional guid- 
ance given to auditors for assessing inherent risk. 
These conclusions can be summarized as fol- 
lows. 

(1) A review of the factors that affect inherent 
risk as. presented in audit manuals and as deter- 
mined in the descriptive studies discussed earl- 
ier indicates that the factors considered by au- 
ditors are broad and include not only client fac- 
tors but also audit firm factors. Some of the fac- 
tors considered are easily quantifiable (e.g. 
client’s liquidity position) while others are not 
(e.g. quality of client management or client 
wishes with respect to the conduct of the audit). 

(2) A review of the factor lists that were de- 
veloped in the descriptive studies indicates that 
inherent risk is not independent of control risk 
in that auditors consider some general assess- 
ment of control risk when assessing inherent 
risk. 

(3) The results of descriptive studies also 
imply auditors use some method for selecting, 
evaluating and combining the diverse sets of fac- 
tors mentioned above in coming up with judg- 
ments on inherent risk. What is not clear, how- 
ever, is the relationship among these factors and 
how these relationships are used intuitively by 
experienced auditors in assessing inherent risk. 


RESEARCH METHOD 


The long range purpose of this research effort 
is to develop a theory of how auditors assess in- 
herent risk in a normal audit environment, to 
test the theory by developing a computational 
model that reflects the theory and to compare 
the computational model’s behavior to that of 
experienced auditors. 

Research projects that use cognitive compu- 
ter models to test theories of human judgment 
normally proceed in several distinct phases. 
First, the judgment task and its environment are 
investigated by reviewing relevant literature and 
by interviewing experts in the judgment task. 
Second, more sharply focused data are gathered 
to refine the researcher’s understanding of the 
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task and its environment and to develop an ini- 
tial theory of how the task is performed. The 
methods used in this second phase are varied but 
usually include structured interviews and/or 
concurrent verbal protocols based on case data. 
In the third phase, a theory is derived from the 
data and a computational model built that re- 
flects the theory. Developing a computational 


model of expert behavior requires a detailed un-- 


derstanding of the knowledge components in- 
volved in problem solving, and a mapping of 
these components into a representation scheme 
that can be used to generate expert behavior. In 
effect, the validation of the computer model is 
achieved by subjecting it to stimuli that ahuman 
expert might be expected to deal with and ob- 
serving whether the process and the outcome of 
executing the model match those of the expert. 
The real bottleneck in developing the model is 
typically the lack of adequate and correct prob- 
lem solving data from the expert. 

The fourth phase attempts to overcome this 
bottleneck. Experts are asked to interact with 
the computational model and to evaluate its per- 
formance. Their evaluation of both process and 
outcome provides additional problem solving 
data which can be used to revise the computa- 
tional model (and often the underlying theory). 
This is an iterative process and can be quite ex- 
tensive and span several years. Once the compu- 
tational model’s performance has reached a 
satisfactory level, it it tested in more controlled 
ways. This final phase usually involves having ex- 
perts provide blind evaluations of the model’s 
behavior and/or comparing the model’s_be- 
havior on a given case to that of experts evaluat- 
ing the same case. 

The research efforts reported in this paper 
have followed this general model through the 
beginning of the fourth phase. We have de- 
veloped the initial theory (conceptual model) as 
well as an initial computational model and have 
received preliminary evaluations from experi- 
enced auditors. Our initial data were obtained 
through archival sources and unstructured in- 
terviews with several auditors. The main limita- 
tion of interviews, of course, is that they can pro- 
duce “summary behavior” which may not be a 
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correct manifestation of the problem solving 
process which one might observe with real cases 
(Dhar & Pople, 1987). Nevertheless, the inter- 
views were useful in several respects. Specifi- 
cally, they helped reveal the following. 

(1) A delineation of various phases involved in 
the overall audit process. These phases consist of 
planning, tests of the control system, substantive 
testing, evaluation of evidence and formulation 
of an opinion. 

(2) Aspects of the audit process that auditors 
find particularly difficult (such as identifying 
specific characteristics of general ledger ac- 
counts that might increase the risk associated 
with an account). 

(3) Areas of the audit in which expert judg- 
ment is required. 

(4) The level of detail of the data involved in 
the various phases of the audit process. 

The level of understanding of the audit prob- 
lem gained through the interviews set the stage 
for subsequent, more targeted data gathering. 
Specifically, the first part of the data gathering 
process made it clear that it would be best too 
focus first on the audit planning phase since this 
phase results in a context within which sub- 
sequent judgments about a client are made. In 
particular, the concept of inherent risk evalua- 
tion was identified as being central to determin- 
ing how to plan an audit in the first place. For this 
reason, subsequent data gathering focused on in- 
herent risk evaluation as performed in actual 
audit situations. 

Several methods were employed in the 
second stage of inquiry. These approaches were 
all designed to enable us to identify the factors 
that auditors felt were important in assessing in- 
herent risk and to identify the process used to 
analyze inherent risk. First, we asked auditors to 
review the documentation of the planning 
activity for audits on which they had recently 
worked and to attempt to recall the data and 
reasoning they used to arrive at their conclu- 
sions concerning inherent risk. Second, we ar- 
ranged a role playing exercise in which one 
audit manager pretended he was turning over 
one of his clients to another audit manager as 
part of a normal rotation of staff. Finally, we ob- 
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served unobtrusively two actual audit planning 
meetings. The observations were followed im- 
mediately by a brief question and answer session 
with each participant. Except for one of the plan- 
ning meetings, all of the data collection sessions 
were tape recorded and transcribed for later 
analysis.! Field notes were taken during the last 
planning session and a complete analysis of the 
session was produced immediately after the 
meeting and question and answer sessions were 
over. ` 

We elected not to fashion realistic cases to 
present to the expert for solution, a method 
often used by cognitive psychologists and AI re- 
searchers for inducing and observing real expert 
problem solving behavior because of the lack of 
a well established information processing model 
of inherent risk assessment. Without the clear 
guidance of such a model, we were concerned 
that any cases we developed might not accu- 
rately capture the task environment and there- 
fore distort our theory development. 

The analysis of the data also proceeded in sev- 
eral phases. The first phase consisted of scanning 
the transcripts and field notes looking for any 
statement that would identify a factor that had 
influenced an inherent risk assessment. That is, 
statements of the general form “I altered the 
scope on that account because...” or “I felt that 
that account would be more risky because...”. 
Analysis of the resulting list of factors produced 
a simple classification scheme that grouped the 
factors into three main classes: general firm fac- 
tors that affected the overall riskiness of the firm 


but were not directly.associated with a given ac- 


count, account factors that could directly affect 
a given account and audit firm factors. Table 1 
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contains a representative sample of the factors 
identified and the classification scheme de- 
veloped. l 

The list of factors and the classification 
scheme provided evidence concerning the 
knowledge components involved in inherent 
risk assessment and the level of fineness or 
specificity with which auditors dealt with these 
components. The second phase of data analysis 
consisted of reanalyzing the transcripts and field 
notes to determine why the factors in the list 
were important and how auditors made linkages 
between these factors and levels of inherent risk. 
The specific procedure involved locating places 
in the text where inherent risk factors were 
mentioned and reviewing the surrounding text 
for any explicit comments about why the factor 
was important and how it affected risk. Such 
explicit comments were rare. Consequently, the 
bulk of our analysis in this phase involved 
generating hypotheses about the relationship of 
factors to risk based on our general business 
knowledge and confirming these hypotheses 
with the auditors. 

For example, one subject stated that he had 
been a little more skeptical about management’s 
judgments, particularly related to valuation ac- 
counts, when he discovered a potential manage- 
ment buy-out plan. He went on to state that the 
audit scope for the valuation accounts (e.g. allo- 
wance for doubtful accounts) had been in- 
creased in this case. We inferred from his com- 
ments that: 

(1) the auditor felt management motives were 
important in assessing risk because of the impor- 
tance of management judgment in determining 
and therefore potentially manipulating, certain 


TABLE 1. Classification of audit risk factors 





External to firm Within the firm 
General firm factors Nature of the industry Level of management integrity 
Direct account factors Loss of major customer High turnover ofkey personnel 
Audit firm factors Amount ofturnover inauditteam Not applicable 








‘After reading the transcript from the first planning session, the participating firm decided the information was too sensitive 
to record. They did consent to our taking notes during the second session. We felt that the importance of having access to 
real data in real situations justified the extra effort and potential data interpretation problems that are associated with field 


notes. 
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account balances; and 

(2) the auditor was using knowledge about 
the effects changes in valuation accounts would 
have on reported earnings, the relationship be- 
tween reported earnings and stock prices and 
the effect stock prices would have on the poten- 
tial cost of a management buy-out as a basis for 
developing his concern for the increased risk as- 
sociation with valuation accounts. 

Comments of this type were important in 
helping us develop the conceptual model and 
the assertions supporting it which are presented 
in the next section. In addition, an analysis of the 
inferred relationships led us to include a general 
model of fundamental economic relationships 
that exist within any firm as part of computa- 
tional model which is described later. 


CONCEPTUAL MODEL 


This section presents a description of our con- 
ceptual model of inherent risk evaluation. An 
overview of the conceptual model is provided, a 
list of specific assertions implied by the model 
are discussed and examples of evidence that sup- 
port those assertions are presented. We relate 

' these assertions in terms of a process model that 
is designed to reflect the actual process of inhe- 
rent risk assessment. The declarative knowledge 
used in the process model is described in the 
next section where we discuss the computa- 
tional model. 

Since much of our analysis described in the 
previous section was subjective, we present raw 
transcript excerpts in this section to allow the 
reader to draw their own conclusions concern- 
ing the evidence supporting the assertions. Fig- 
ure 1 presents a graphic representation of the 
process model and is intended to provide a 
global view of the inherent risk evaluation pro- 
cess, 


General description of the process model 

The process model refiects what we term a 
differential approach towards assessing the in- 
herent risk of material error in a given general 
ledger account balance. More specifically, our 
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model asserts that the auditor begins the inhe- 
rent risk evaluation process by generating ex- 
pectations for accounts balances. The auditor 
identifies changes that have occurred in the firm 
or its environment and determines how those 
changes should interact with historic trends to 
produce an expected balance in the account. In 
order to do this, we posit that the auditor uses an 
understanding of the relationships between 
firm/environmental factors and general ledger 
accounts. We term these relationships as the “in- 
ternal model of the firm” (described in the next 
section). By using this model of the firm, the au- 
ditor determines how the changes should affect 
the balances in a given account. For accounts 
that contain actual balances that are outside a 
reasonable range of the expected balance, the 
auditor reviews factors that might create or af- 
fect management incentives to misstate the ac- 
count balances (e.g. the existence of a compen- 
sation plan keyed to reported earnings). At the 
same time, the auditor considers factors that 
might impact on the likelihood that manage- 
ment could or would misstate that particular ac- 
count balance (e.g. the degree of judgment al- 
lowed in the determination of account ba- 
lances). The auditor would also consider such 
unintentional factors as the complexity of the 
transactions or accounting for a particular ac- 
count since such factors might also be responsi- 
ble for deviations from expected balances. Based 
on this analvsis, the auditor decides if additional 
evidence will be needed to determine whether 
the difference between the expected balance 
and the actual balance was due to an error in the 
expectation generating process, a legitimate re- 
sponse by management to a change in the envi- 
ronment, an unintentional error or a questiona- 


ble response by management. 


Empirical support for the model 

The inherent risk evaluation process de- 
scribed above is expectation driven, where ex- 
pectations about general ledger account ba- 
lances are based on knowledge of historical 
trends and specific changes in the audited firm’s 
environment that affect the audited firm. 
Further, the output of this evaluation process is 
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Fig 1. Conçeptual model of inherent risk evaluation. 


an analysis of factors that might explain differ- should note that all of our transcripts contain 
ences between expected and actual account ba- statements that support all of the assertions in 
lances. These factors include management’s in- the model. The fact that the transcripts were 
centives and abilities to affect the account ba- taken from data collection session. that involved 
lance as well as the complexity associated with different firms, different auditors at different 
calculating the account's balance. levels of responsibility and in different task set- 
In the remainder of this section, we provide tings provides strong support for the model. Ap- 
examples of transcript evidence that supportthe pendix A contains a short version of each asser- ` 
assertions of our conceptual model. Although tion and a sampling of taped quotations support- 
we offer only a few representative samples, we ing that assertion. The following discussion pre- 


ASSESSING INHERENT RISK 


sents a general summary of the evidence for each 
assertion. In the section describing the computa- 
tional model we discuss how specifically these 
assertions are used. 


Assertion 1: inberent risk assessments are 
generated on an account by account basts. 
There was strong evidence across all transcripts 
that auditors ultimately boil risk assessment 
down to the account level in order to be more 
precise in adjusting their audit effort. Additional 
support for this assertion was found in the audit 
procedures of the two audit firms participating 
in our study. One of the CPA firms used an op- 
tional form that records auditors’ assessments of 
inherent risk on an account by account basis. 
The other firm was beginning to use a form that 
tied account level risk'assessment to the use of 
specific audit procedures which address those 
risks. 


Assertion 2: auditors generate expectations 
concerning account balances and investigate 
balances that differ from tbese expectations In 
interviews dealing with the audit planning pro- 
cess in general, subjects made several references 
to having some notion of what they expect to 
find in an audit. Specifically, they appeared to 
have developed ranges within which they ex- 
pected given account balance values to fall. 
These ranges may be imprecise and based on the 
expected direction of change for the account as 
opposed to quantified values. 


Assertion 3: auditors generate expectations 
based on changes in events or circumstances re- 
lative to prior years. General planning inter- 
views as well as firm documents provide evi- 
dence that determining what has changed from 
last year is a critical part of initial audit planning. 
Specific examples from data gathering sessions 
that involved specific clients confirmed this ob- 
servation. In all cases, the main thrust of the au- 
ditor’s initial data gathering during audit plan- 
ning was determining what had changed from 
last year. The only exception is with large, 
sophisticated clients where the auditor has a 
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continuous relationship with the client and be- 
comes aware of these changes as they occur. 


Assertion 4: management's incentives and 
abilities to manipulate account balances are 
of special importance in assessing inberent 
risk. There was ample evidence in both specific 
and general settings that management incentives 
were a key consideration in evaluating inherent 
risk. In addition, accounts that are subject to 
management judgment (e.g. the allowances for 
doubtful accounts and obsolete inventories) 
were almost always considered more risky. This 
emphasis on management motives and abilities 
is consistent with the emphasis placed by Peat 
Marwick International on the degree of manage- 
ment judgment involved in valuing accounts and 
the degree to which the client’s financial condi- 
tion affects management's incentives discussed 
earlier. 

Additional evidence on the related issue of the 
importance of management incentives in choos- 
ing accounting methods appears in theoretical 
and empirical research in accounting. In a sum- 
mary of this research, Watts & Zimmerman 
(1986) conclude that accounting method 
choices can be driven by the existence of bonus 
plans, bond covenants and public pressures that 
affect management's incentives to report certain 
financial and operating results. 


Assertion 5: to be useful, inberent risk assess- 
ments should provide an analysis of why a 
given account is risky rather than merely a 
quantitative estimate of risk. Several quotes in 
the transcripts demonstrated the complexity of 
inherent risk assessment and the importance of 
interrelationships between accounts. While 
quantitative estimates may reflect an auditor’s 
overall feeling about an account, they do not in- 
dicate why the account is risky or what aspects 
of the account contribute to its riskiness. The 
transcripts contain several examples of auditors 
making this basic point. In order to be actiona- 
ble, an inherent risk assessment needs to indi- 
cate why a particular account is risky in a 
specific audit context. 

Apart from supporting the specific assertions 
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of the model, the transcripts indicate that they 
co-occur, The last three sections of Appendix in- 
cludes one quote that provides a description of 
. the overall process. The evidence found in the 
transcripts to support the conceptual model is 
further strengthened by the existence of sup- 
porting evidence from other researchers. Bouw- 
man found evidence that financial analysts used 
an internal model of the firm to drive their ex- 
pectations and analysis. In addition, our model is 
closely related to a similar model of auditing in- 
ferred by Johnson et af (1987) in their analysis 
of auditors’ fraud and error detection. The fact 
that they have had similar results using signific- 
‘antly different methods and subjects adds con- 
siderably to the validity of our conceptual 
model. 


DEVELOPMENT OF THE COMPUTATIONAL ~ 
MODEL 


The main purpose of developing an initial 
computational model is to help determine how 
auditors select, evaluate and combine various 
factors or cues in making inherent risk assess- 
ment. The conceptual model identifies cue clas- 
ses (e.g. observed changes, historical financial 
data, incentives ) but provides only general guid- 

‘ance concerning how individual cues within 
these classes are identified as relevant to a 
specific case, evaluated in terms of their impact 
on inherent risk and combined with other cues 
to form. a risk assessment. An initial computa- 

` tional model deals with each of these issues in a 
relatively naive way based on the initial data in 
order to produce behavior that an experienced 
auditor can evaluate. 

A computational model can be presented at 
several different levels of abstraction. Newell 
(1982) describes four main levels that can be 
used to describe such models: device, circuit, 
program and knowledge. At the knowledge 
level, the model is viewed as an agent which is 
composed of a set of goals, a set of actions and a 
body. The agent is viewed as processing knowl- 
edge to select actions to achieve goals. (Newell, 
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1982). Actions are synonymous with the con- 
cept of operators discussed in Newell & Simon 
(1972) and are responsible for affecting the 
agent’s external environment or changing the 
agent’s state ofknowledge. The following discus- 
sion presents our computational model of inhe- 
rent risk assessment as an information proces- 
sing model (Newell & Simon, 1972) at the ` 
knowledge level. (More details of the program’s 
architecture are presented in Appendix B and in 
Dhar et al, forthcoming.) Such a presentation 
requires specification of the goals and actions 
used by the agent to solve the problem under 


study (Card et al.; 1980; Newell, 1982). Figure 2 


presents an overview of the five overall goals in- 
volved in inherent risk assessment and the 
actions used to achieve those goals. The levels of 
indentation indicate a subgoal or action set de- 
signed to achieve a goal. Italicized comments 
provide conditional or iterative modifications of 
goals and/or actions. 

The remainder of this section will present a 
more detailed description of the five key actions 
in Fig. 2: (1) expectation generation (which in- 
volves cue selection); (2) evidence comparison 
and checking; (3) incentive checking; (4) ability 
checking (each of which involves both cue 


Selection and evaluation); and (5) results pre- 


sentation (which involves cue combination). 
The remaining actions involve data access and 
are self-explanatory. This initial or naive model 
does not include an action that assesses the ac- 
counting complexity or mechanical difficulty of 
generating given account balances (e.g. how 
complex the valuation: procedures or account- 
ing standards are that relate to items in the ac- 
count) because our transcripts did not provide 
sufficient evidence for us to determine how this 
assessment interfaced with other actions in the 
model. The following discussion deals with each 
of these actions in some detail. 


Expectation generation 
The primary assumption in expectation gen- 
eration is that an account balance will reflect 


` general historical trends unless something 


specific and observable occurs to alter those 
trends. Therefore, inputs to the computational 
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GOAL: Assess Inherent Risk - Firm Level 


GOAL: Assess Inherent Risk - Account Level 
(repeat until no more observed changes) 


GOAL :Develop Expected Values 


ACTION:Get Observed Changes 


ACTION :Get Historic Data 
ACTION :Generate Expected Value 


GOAL:Test Expectation 


ACTION:Get Actual Current Value 
ACTION:Compare expected to actual 
ACTION: Terminate Processing of Account 
(if expected and actual are close) 


GOAL: Assess Management’s Incentives 


ACTION :Get Affected Incentives 
ACTION:Determine Status and Direction of panes ney 


Incentives 


ACTION :Determine Management’s Ability 


ACTION :Pregent Analysis 


Fig. 2. Information processing model 


model consist of a series of observations about 
changes that have occurred in the firm or its én- 
vironment. The observations contain three 
pieces of information: 

(1) a firm rejated factor that has pee 
where a factor can be an account or some other 
quantifiable factor that influences that balance in 
a general ledger account 

(2) an estimate of the Percentage dene in 
the factor; and 

(3) abrief description of the source of that ob- 
servation. 

“For exainple; the auditor igi observe that 
the client’s market share should fall by 5% 
because a major customer of thè client was 
purchased by a competitor of the client in a ver- 


man, 1978). The model uses a naive assumption | 
that this year’s balance will be the same as last 
year’s if there are no observed changes. There- 
fore, observed changes are processed against the 
prior year’s balance to generate the expected ba- 
lance for the current year. If there are no ob- 
served changes, the prior year’s balance is used 
as the expectation for the current year’s balance. 


Evidence checking ` 

The model next checks the expected value . 
against the actual value in the’ general ledger 
data. The model incorporates a crude form of 
materiality judgment in that it uses a preset per- 
centage range to determine whether two values 
are the same or not. Values that differ by less than 


‘tical merger. However, the auditor is ultimately this amount are treated as being equal. The evi- 


concerned with the inherent risk of an account 
balance, not changes to market share. Therefore, 
the computational model uses a model of a firm 
to relate such factors as market share to sales and. 
other accounts. Figure 3 presents a diagram of a 
portion of this model. 

‘The structure of this firm model has been 
based on empirical evidence gathered in this 
study and on previous empirical studies (Bouw- 


dence checking action drops accounts. from — 
further consideration whose expected and ac- 
tual values are not materially different. 


Incentive checking 
The incentive checking action relates ac- 
counts that are left after the evidence checking 
process is completed to sets of incentiyes*that**._ 
are affected by those accounts. An ingest 
Ye a X 
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Fig. 3. Segment of the firm model. 


represent either a restriction placed on account 
“balances as part of a contract (e.g. bond coven- 
ants), contractual arrangements that tie manage- 
ment’s compensation to levels in account ba- 
lances or expectations by outsiders concerning 
levels of account balances (e.g. public expecta- 
tions of steadily growing earnings). In general, 
incentives represent agreements or expecta- 
tions that impact on management's motives to 
see that certain accounts achieve certain ba- 
lances. Examples of incentives might include a 
management bonus plan that is triggered if gross 
profit exceeds 110% of last year’s or a bond co- 
* venant that requires the current ratio to be grea- 
ter than 2 to 1. The later incentive would be rep- 
resented .within the model by the formula (> 
- current-ratio 2.0). 

Once the model has determined the proper 
relationship between the general ledger account 
balance and the arguments in the incentive for- 

‘mula, it determines which direction manage- 
ment might be motivated to manipulate the ac- 
count balance based on the current value of the 
incentive arguments. ‘That is, given existence of 
the bonus plan mentioned earlier and given that 
gross profit is actually 110% of last years value, 


the model would determine that management 
might be motivated to increase sales. This deter- 
mination is currently based on a very simple de- 
cision rule. The rule states that if the actual value 
of the incentive argument is close to the cut-off 
value, management might be motivated to in- 
crease the value of the argument over the cut-off 
by manipulating the general ledger account ba- 
lance related to the argument in the appropriate 
direction. Given the gross profit example just 
cited and an actual gross profit level near 110% 

of last year’s, the model would conclude that 
management might be motivated to increase 
gross profit by increasing sales. 

If the actual value is far from the cut-off, the 
model assumes that management might be moti- 
vated to decrease the value of the argument in 
relation to the cut-off. This simple decision rule 
is based on the observation that most accounting 
decisions over which management has some 
control merely shift values from one year to the 
next. Therefore, if the argument value is either 
too far below a cut-off or too far above a cut-off, 
management might be motivated to shift values 
to increase the chances of making the cut-off 
next year. In the gross profit exemple, if this 
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year’s gross profit is well over 110% oflast year’s 
gross profit, management would be motivated to 
save some gross profit for next year by shifting 
sales to next year to help insure they would 
make the cut-off for a bonus next year. This deci- 
sion rule is overly simplistic and yet captures the 


essence of many of the concerns expressed by. 


auditors in the transcripts. In addition, it is con- 
sistent with empirical evidence of manager be- 
havior on the related issue of choice of reporting 
methods (Healy, 1985). 


Ability checking 

The model incorporates a very simple ability 
checking procedure. Each general ledger ac- 
count is coded to indicate whether or not that 
account is particularly sensitive to management 
manipulation. Such accounts as inventory and 
receivable valuation accounts would be coded 


“yes” and less sensitive accounts like cash would ` 


be coded “no”. Ability checking merely involves 
making a note of this information for the general 
ledger account being processed. 


Presentation of results 

At this point in the processing, all the informa- 
tion needed to make an initial assessment of in- 
herent risk has been accumulated. The model 
now pulls this information together and prints it 
out in an organized fashion that reflects its mean- 
ing. The current implementation of the model 
does not have the capability to perform any real 
analysis on this information such as looking for 
patterns in the results across accounts and draw- 
ing more general conclusions. 


Test case 

Appendix C presents a trace of the computa- 
tional model’s behavior on a test case used in its 
initial development. The Appendix begins by 
presenting an example of the data entry screen 
used to input the test data as well as a printout of 
the entire data set and concludes with a trace of 
the model’s processing and output on that data. 
Italicized comments have been added to the 
trace to provide a more thorough explanation of 
the model’s activities. 
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EVALUATION OF THE MODEL 


The model has been demonstrated to several 


auditors who helped us design it. Their com- 


ments and critiques have been useful in helping 
us understand the limitations of the existing 
model and what must be done to alleviate them. 
Their general assessment was that the model was 
“going in the right direction” and that viewing 
inherent risk assessments as an expectation fai- 
lure evaluation process with explicit links to 
management’s incentives showed promise. 


‘However, they felt our initial model had several 


major limitations. These limitations are de- 
scribed below. 


Overly data driven expectation generation 
Our model’s control of attention is driven by 
the observations specified by the user. However, 
we have now come to realize that auditors’ 
analyses are to a large extent driven by the struc- 
ture of the general ledger, beginning with cur- 
rent assets and ending with extraordinary items. 
More importantly, as this “downward scanning” 
proceeds, expectations about values of accounts 
yet to be reviewed become increasingly con- 
strained. If the actual value does not fit with ex- 


. pectations, a reinterpretation of previously re- 


viewed accounts becomes necessary. This type 
of reinterpretation does not take place in our 
model. 


Overemphasts on incentives 

A second criticism, related to the one above, 
was that our model emphasized management 
motives too strongly. This effect becomes even 
more noticeable since the model ignores the re- 
lationships among firm factors described above. 
While incentives are an important determinant 
of inherent risk, we have now found that they are 
usually not considered by auditors unless the 
value of the constrained item-in thé incentive 
formula is close to the boundary. That is, our de- 
fault assumption, that account balances have 
been influenced by incentives, is not an accurate 
“one. In fact, the opposite is often the case. Also, 
the second decision rule that shifting may occur 
when the value of the constrained item in the in- 
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centive formula is some distance from the bound 
is often applied inappropriately. This decision 
rule needs to be specialized. In addition, au- 
ditors did not view all incentives as having equal 
potential impact on management’s actions. For 
example, violating a bond covenant would be 
considered more serious than not achieving a 


budgeted goal. 


Extrapolation of data 

Our model uses the prior year’s balance as a 
basis for developing an expected balance for the 
current year. Auditor’s tend to use more compli- 
cated processes that take into consideration 
recent trends in account balances. Our model 
will need to incorporate a trend extrapolation 
mechanism on which to base its expectation 
generation. In addition, audit planning typically 


occurs prior to the client’s year end. Because of . 


this, auditors do not have actual, unaudited year 
end balances with which to test their expecta- 
tions. In order to deal with situations where only 
part year is available, our model will have to be 
expanded to annualize part year balances. In ad- 
dition, some accounts do not lend themselves to 
extrapolation, e.g. extraordinary items. Our 
model will need account specific methods for 
creating expecatons for these types of ac- 
counts, 


Unintentional errors 
An important addition to the current model 
that auditors deemed necessary was some 
mechanism for dealing with unintentional errors 
based on accounting complexity and mechani- 
cal difficulty. Examples would include miscal- 
culating inventory values or accidentally failing 
to include all outstanding invoices in the ac- 
counts payable balance. Our model has no way 
to deal with such errors. In addition, our concep- 
` tual model does not deal with characteristics.of 
the firm’s management environment that could 
affect all, or at least a broad range, of accounts 
(e.g. management’s concern for the internal con- 
trol environment or high employee turnover in 
the accounting department). The auditors felt 
that these two classes of factors that influence 
the likelihood of an unintentional error in an ac- 
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count were extremely important in determining 
the appropriate inherent risk for a given ac- 
count. 


SUMMARY AND SUGGESTIONS FOR FUTURE 
RESEARCH 


The model described here isa summary ofa2- . 
year research effort, during which time we at- 
tempted to observe in as much detail as possible, 
the audit planning process with real cases. On 
the basis of our evidence, we have concluded 
that auditors do not consider it appropriate to 


-generate numerical estimates of risk on an. ac- 


count by account basis, but reason, about a 
client’s financial statements using knowledge 
about changes in the industry and-or the client, 
management’s motivations, prior track record 


‘and so on. Computer based models that attempt 


to replicate this process must be capable of mod- 


, eling this range of knowledge. While our model 


is still inadequate in the ways described above, it 
is proving to be very useful in sharpening our un- 
derstanding of the process of inherent risk as- 
sessment. In particular, giving auditors a real 
model to use (however limited) helped elicit 
data which would have been virtually impossi- 
ble using interviewing and other data gathering 
techniques. Specifically, we are continuing to 
work with the auditors who helped us design the 
model as well as other auditors to determine 
more precisely how to enhance the model in the 
following ways. 

(1) Provide attention digesting mechanisms - 
that follow the normal layout of a general ledger 
but that also recognize basic relationships be- 
tween accounts and either defer judgment on a 
given account until-all related accounts have 
been evaluated or reevaluate a given account 
when additional evidence is produced. 

(2) Revise the treatment of management in- 
centive issues to more closely match the au- 
ditor’s treatment. 

(3) Produce expected values based on trends 
in account values and account specific knowl- 
edge and provide the ability to deal with part 
year data. 
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(4) Provide mechanisms for dealing with 
mechanical or unintentional errors. 

(5) Provide an analvsis of the evidence 
gathered by the model that reflects patterns in 
the data and helps direct the auditor toward 
specific areas where additional data may be 
needed to resolve differences between ex- 
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pected and actual account balances. 

Empirical evidence from auditors will be 
needed to make these enhancements to the 
computational model. However, the existing 
model provides a useful framework for soliciting 
the types of data needed to address these issues. 
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APPENDIX A. EXAMPLES OF STATEMENTS SUPPORTING MAJOR ASSERTIONS 


Assessment of the account level 
The following two quotes deal with auditor's reaction to a new workpaper that was being tested by one 
of the CPA firms involved in this research. The workpaper required auditors to rate the inherent risk of each 
account. 
“I think it probably lays out on paper how one’s mind works; I probably think that way but I don’t 
need that workpaper to help me think that way because I already do.” 
“. which will take the financial statement captions and try to isolate whether we think it's high 
flow, low flow, medium flow, you know, what type of risk we think is associated with it in better at- 
tempt to design appropriate audit approaches and I think ld want to do that here.” 


Expectation generating process 
General planning interview: 
“So you almost have a preconceived notion of what you are going to see and as long as you see that 
you got the explanation ready for it so you don't have to do much digging.” 
Specific client planning meeting: 
. S1 — "We, it just seems unusual with a high volume company like this you think you would de- 
velop percentages, you know, this seems like with the type of business they would be in you could 
develop reserve percentages but...” . 
S2 — “Actually, you would think so but their history of write-offs has not been, has not been signifi- 
cant.” 


Differential model 
General planning meeting: 

“So what I end up doing is concentrating on what's different or unique about this year versus last 

year. That’s where you get into what's changed.” 
Specific client planning meeting: 

“Normally [XYZ, co.] is so simple when you think about it, debt, totally findncing inventory and 
receivables with very little investment and fixed assets that I would have just honed in on inventory 
and accounts receivable, compared it to last year’s audit report which we have, which I brought into 
the mecting and I would have said, ‘well, is it level, went up or did it go down. Receivables up or 
down.’ That type of thing.” 


Management's incentives and abilities 

General planning interviews: . 

i “Why, when someone says to me, ‘What is the inherent risk on the engagement?’, I first think of 
people. That’s what I do. And the strength of management and their ability of management to over- 
ride what ever the systems create or to influence whatever the systems create, dramatically so.” 

“.,.a sophisticated, multinational organization that’s well controlled and has very formal budgetary 
and planning controls in its normal reporting, I think the thing that you've got to, the key factor ts 
what is the likelihood of management overriding those controls. .. The way company management ` 
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. is compensated comes into play because, to the extent that even financial management is rewarded 
based on the achieved level of reported results, it creates an incentive to make those reported results 
reach the target levels.” 


Need for explanations of why account ts risky 
General planning interviews: 

“We look for interrelationships. We look to see if sales are down, you would expect receivables 
to be down. If sales are up and you see that receivables are down, you might ask yourself, “Has he 
done a better job in collecting the accounts? Did he offer a special discount program?” And if sales 
are up and receivables are down, has he invested the money in inventory or is he holding it in cash. 
You know, are all those things consistent?” 

“So what’s so critical about it and why. In not very general terms but very specific terms. Out of 
this one hundred million dollar balance what piece am I really concerned about. Okay, there’s prob- 
ably five million out of that one hundred million that Fm really concerned about but how do I get 
there?” 


General support for the conceptual model 

“I think the process you go through to obtain that knowledge really is to gain an understanding of 
the client’s business, an understanding of the client, an understanding of how the fluctuations in the 
economy might affect a client’s business. You compare the client’s business to other businesses in 
the same industry to see if they are having consistent operating results and if not if there are logical 
reasons for it and if they are having consistent operating results if that is what you expected. I mean 
you develop expectations in your mind of what you expect to see and to the extent results don’t con- 
form to that yet. You begin asking questions and then you obtain the knowledge.” 


APPENDIX B. DESCRIPTION OF SYSTEM ARCHITECTURE 


The primary concerns in choosing an architecture for the model were psychological validity and flexibil- 
ity. Alchough it is difficult if not impossible to determine the psychological validity of various representation 
schemes at program level (Anderson, 1978), there is considerable theoretical and empirical support for 
separating declarative knowledge from procedural and representing declarative knowledge as a network 
and procedural knowledge in the form of production systems (Anderson, 1983). Therefore we elected not 
to use available systems based strictly on rule formalism that other researchers have used (see, for example, 
Dungan & Chandler, 1985; Hansen & Messier, 1986; Steinbart, 1987) and decided to develop our modei 
directly in the LISP progamming language (see for example, Biggs & Selfridge, 1986; Shpilberg & Graham, 
1986). This approach also allowed us greater flexibility (see Dhar & Pople, 1987, for a discussion of rule 
based systems’ ability to adapt). We further elected to use an object oriented approach to add structure to 
the program while retaining maximum flexibility (see Stefik & Bobrow, 1986, for a discussion of object 


oriented programming). 


Procedural knowledge 

Procedural knowledge in the model was represented in four different ways: as rules, productions sys- 
tems, checklists and procedures. Each representation method was implemented as a class of structured ob- 
jects, . 


Rules. Rules primarily resided inside production systems but can also be included in checklists. Rule ob- 
jects contain five fields: condition, action, variables, trace message and explanation. The condition and 
action fields are straightforward and represent the “if” clause and “then” clause of “if... then” rules. In keep- 
ing with the structured object formalism, rules are allowed to have local variables. However, this feature is 
seldom used. The trace message allows the model to display relevant information when the rule fires. This 
is useful for program debugging and for matching the model’s process behavior against a human subject’s 
verbal protocol. The explanation field is used like a comment line and facilitates debugging the model. 
Trace messages and explanation fields are common to all procedural objects and will not be discussed 
further. : 
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Production systems. Production systems are rule packets that are interpreted sequentially until one fires 
and then the process begins at the beginning of the packet again. The processing terminates when no rules 
fire or when a rule tells the production system to stop. Production systems contain four fields: rule list, trace 
message, variables and explanation. The rule list is merely that, a list of rules that are checked sequentially 
until one fires. The variables represent the production system's local working memory. The values for these 
variables can be set by passing the production system information when it is called or by action of rules. 


Checklists. Checklists represent a specialized form of production system. They execute a list of pro- 
-cedural objects (rules, other checklists, production systems or procedures) sequentially until all have com- 
‘pleted their activity or one of them tells the checklist to stop. In addition, procedural objects can tell the 
checklist to skip other procedural objects. Checklists contains four fields: list of procedural objects, vari- 
ables, trace message and explanation. The variables represent the checklist’s local working memory. 


Procedure. Procedures are merely nice ways for the model to call LISP procedures directly. They contain 
four fields: variables, trace message, explanation and procedure definition. We have attempted to minimize 
use of these objects in that knowledge is not as explicitly represented as it is in the other procedural objects 
and have limited their use to complex mathematical calculations. 


Summary. The procedural objects can be linked together dynamically in call chains similar to function 
calls in LISP or any programming language which means the model has great flexibility. However, they are 
data structures that conform to certain standards of explicit knowledge representation which gives the 
model access to its own knowledge. This feature will be useful in future enhancements of the model that 
deal with improved explanation and learning. 


Declarative knowledge 

The declarative knowledge of the model primarily Involves the firm model. The firm model is rep- 
resented as a node and link network where both nodes and links are implemented as structured objects. The 
nodes in the network or firm model elements, are complex objects that contain a variety of fields. The most 
important of these are sets of formulae that determine how the values of the clement can be calculated from 
other elements. There are also ficids for various types of values (e.g. percentage change, current, prior). The 
formulae represent the links to other nodes and are themselves represented as structured objects. These 
formulae objects contain methods for calculating the various types of values associated with a firm model 
element. 

In addition to these permanent data structures, the model utilizes temporary data structures to accumu- 
late the results of the model’s processes and to store case specific data. Data structures called expectations 
store the expected and actual values for a given firm model clement as weil as the results of the incentive 
and ability checking process. Complete histories are also maintained of how each value was calculated to 
facilitate explanation. Observation objects contain the auditor entered observations concerning explicit 
changes in a given firm model element. They contain three fields: change value, firm model element and a 
comment. The comment is not currently used by the model except to display the source of expected value 
calculations. Incentive objects contain formulas that indicate boundaries that management might want firm 
model element values to cross. Incentive objects are case specific. 

The general ledger data for each case is not represented internally in the model but read in with each case 
from a standard ASCII file. This feature facilitates interfacing the model directly with existing data bases. 


General data 

The model is menu driven and there are some simple editing features to allow for easy entry of case 
specific data. The model reported in this paper runs on an IBM or compatible micro computer with 512K 
CPU and two floppy disk drives. A version currently under development requires a hard disk and 5 mega- 
bytes of CPU memory. The new version resides on a COMPAC portable II microcomputer and uses a Gol- 
dhill 286 development package. We have found that use of a portable microcomputer is a real advantage 
in that the model can be easily demonstrated to auditor's in their offices without any special equipment. 
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APPENDIX C. TRACE OF MODEL BEHAVIOR 


Three types of data are entered into the model by the user: observations, incentives and general ledger 
account balances. General ledger balances are input from a disk file while observations and constraints 
are entered from a menu driven editor. The following is an example of one tnput screen used when the 
user entered an observation, bold items were entered by the user. 


Adding a new observation to TEST case 

Please enter requested information when prompted. 

Firm or environmental element effected: paid in capital 

Amount of expected change as a %: 5 

Source of expected change: Public sale of common stock 

(E)nter another observation or (Quit? Q 

A test case was built using this menu screen, a similar screen for incentives and a disk file for general 
ledger data. The following coptes of model display screens present the total data set for the test case. The 
model displays a natural language translation of the internal data structures. 

The following observations are on file: 

(1) an expected 5% change in paid in capital based on public sale of common stock; 

(2) an expected 7% change in common stock based on public sale of common stock; 

(3) an expected — 10% change in firm’s market share based on purchase of customer by competitor; 

(4) an expected —4% change in average manufacturing labor rate based on new union contract; and 

(5) an expected —5% change in total market demand based on trade publications. 

The following incentives are on file: 

(1) maximum gross profit level for the management compensation package which is based on the restric- 
tion that the current year’s gross profit is less than or equal to the prior year’s gross profit times 1.15; 

(2) minimum gross profit cut-off for the management compensation package which is based on the re- 
striction that the current year’s gross profit is greater than or equal to the prior year’s gross profit times 1.1; 

(3) debt to equity restriction due to a bond covenant which is based on the restriction that the current 
year’s long term debt to equity is less than 0.5; 

(4) working capital restriction due to a bond covenant which is based on the restriction that the current 
year’s current ratio is greater than or equal to 2; and 

(5) public expectation for increasing earnings which is based on the restriction that the current year’s 
net income is greater than or equal to the prior year’s net income. 


The following general ledger data is on file. 


Account name Current value Historic value 
Cash 1144 1140 
Short term investments ; 1750 1750 
Accounts receivable 14,542 11,362 
Ending inventory 39,364 35,388 
Prepaid expenses 1087 1000 
Land 1268 1268 
Buildings 7114 7100 
Machinery 11,109 11,000 
Furniture 1674 1700 
Total other assets 844 840 
Current portion of iong term debt 579 , 579 
Accounts payable 15,166 15,000 
Federal income taxes payable 2326 2300 
Total long term'debt ; 13,681 13,680 
Convertible stock d 311 311 
Common stock 5581 5218 
Paid in capital 7846 7499 


Continued over 
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Continued 

Account name Current value Historic value 
Ending retained earnings 24,594 19,406 
Dividends 833 833 
Beginning retained earnings 19,406 12,239 
Income tax expense 5724 0 
Sales 121,270 123,000 
Manufacturing labor 43,691 45,000 
Manufacturing materials 17,477 17,000 
Manufacturing overhead 26,215 26,000 
Ending inventory 39,364 35,388 
Beginning inventory 35,388 36,388 
General and administrative expenses . 26,118 26,000 


The following ts a trace of the system’s activity given the above test data. 

Generating expected change values for general ledger accounts based on 5 observations on file. 

The model first propagates the.cbange values from tbe 5 observations through its internal model of a 
firm to assess the impact of those changes on general ledger account balances. 

Checking expected changes against actual change values. 

Once expected changes in general ledger account balances bave been calculated based on the observa- 
tions and firm model, the model calculates the actual change using the general ledger data for eacb gen- 
eral ledger account 

Determining mangement motives in relation to unfulfilled expectations based on 5 constraints on file. 

Determining degree of latitude for management manipulations. 

Next, the model determines the relationsbip between the unfulfilled expectations and the list of man- 
agement incentives on file. 

I have completed my analysis, please hit any key to review the results. 

The model concludes tts analysts by presenting tbe data that bas accumulated during ibe analysts. The 
following is a printout of all the data accumulated for one general ledger account in this test case. 

I did not expect a change in accounts receivable. 

The actual change in accounts receivable is 28%. 

This account is particularly sensitive to management override. 

This account affects the following constraints. 

A working capital restriction due to a bond covenant which is based on the restriction that the current 
year’s current ratio is greater than or equal to 2. Management may be motivated to decrease accounts re- 
ceivable because current ratio is 60.2% greater than 2 which is far enough from the limit to encourage shift- 
ing items to next year. 
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